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FIELD OF THE INVENTION ] 

The present invention relates generally to compositions and methods for the regulation of 
steroidogenesis. More particularly, it concerns compositions and methods relating to the 
regulation of transport of cholesterol into the mitochondria of a cell for the synthesis of 
10 androgens, estrogens, glucocorticoids, mineralocorticoids, and progestagens. The invention also 
1 relates to methods for detecting and treating steroid hormone-dependent disorders. The nucleic 
t acid molecules of the present invention also provide methods for screening a sample for steroid 

hormone-dependent disorders, as well as to methods for preparing recombinant proteins for 
1 StAR. The invention further relates to the field of nucleic acid probes and primers, as the various 
jf nucleic acid molecules of the invention may be used as molecular probes in all of the 
R aforedescribed methods, as well as primers for amplifying particular sequences of interest 

O BACKGROUND OF THE INVENTION 

° The testis is known to be the source of circulating androgens that are responsible for the 

20 maintenance of the secondary sexual characteristics in the male. In most species the testis has 
two separate compartments: the seminiferous tubules that contain the Sertoli cells, the 
peritubular cells, and the germ cells; and the interstitial compartment that contains the Leydig 
cells, macrophages, lymphocytes, granulocytes and the cells composing the blood, nerve and 
lymphatic structures. 



25 



The Leydig cells, located in the interstitial compartment and comprising approximately 2- 
3% of the total testicular cell number in most species, are the only cells in the testis that contain 
two key steroidogenic enzymes, namely, cytochrome P450 side chain cleavage (P450scc) and 3 
beta-hydroxysteroid dehydrogenase (3 beta-HSD). Thus, Leydig cells are the only testicular 
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cells capable of the first two steps in steroidogenesis; i) the conversion of cholesterol, the 
substrate for all steroid hormones, to pregnenolone; and ii) conversion of pregnenolone to 
progesterone. Therefore, the interstitial compartment in general, and the Leydig cells in 
particular synthesize virtually all of the steroids produced in the testis, with testosterone being 
5 the major steroid biosynthesized. 

The major stimulus for the biosynthesis of testosterone in the Leydig cell is the 
gonadotropic hormone, luteinizing hormone (LH). LH is secreted from specific cells located in 
the anterior pituitary and it interacts with specific receptors on the surface of the Leydig cell and 

1 0 initiates the signal for testosterone production. Cellular events occur rapidly in response to the 
trophic hormone stimulation of Leydig cells, and result in the synthesis and secretion of 

yp testosterone. These rapid or acute effects of hormone stimulation occur within minutes and can 
(\ be distinguished temporally from the slower chronic effects that occur on the order of many 

hours and that involve mechanisms to increase gene transcription and translation of the steroid 
§5 hydroxylase cytochrome P450 enzymes involved in the biosynthesis of these steroids. 

p* The rate-limiting enzymatic step in steroidogenesis is the conversion of cholesterol to 

M pregnenolone by the cholesterol side-chain cleavage complex (CSCC) which is localized to the 
pi mitochondrial inner membrane (Stone and Hechter, 1954; Karaboyas and Koritz, 1965; Simpson, 

11 et al 1972). However, the delivery of the substrate cholesterol from cellular stores and the outer 
mitochondrial membrane to the inner mitochondrial membrane and the CSCC is the true 
regulated, rate-limiting step in this process (Crivello and Jefcoate, 1980; Jefcoate, et al, 1987). 
Cycloheximide, an inhibitor of protein synthesis, blocks the hormone-induced steroid production 
in two steroidogenic tissues of the rat; the adrenal and testis (Ferguson, 1963; Garren, et al, 

25 1965; Davis and Garren, 1968; Jefcoate et al, 1974; Mendelson etal, 1975; Cooke, et al, 
1 975). This block is at the point of transfer of cholesterol from the outer to the inner 
mitochondrial membrane and the CSCC (Simpson et al, 1972;Privallee/a/. 1983). Therefore, 
acute regulation of steroidogenesis requires de novo protein synthesis (Jefcoate et al, 1986). 
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During protein import into the mitochondrial matrix, the inner and outer mitochondrial 
membranes become closely associated and form protein translocation "contact sites" (Schleyer 
and Neupert, 1985; Schwaiger etaL, 1987; Glick, etal, 1991). Phospholipids are transferred 
from the outer mitochondrial membrane to the inner mitochondrial membrane at these membrane 
5 "contact sites" (Simbeni etaL, 1990; Simbeni etaL, 1991; Ardail etaL, 1991). Therefore, the 
intramitochondrial cholesterol translocation required for steroidogenesis may also occur at 
membrane contact sites. An increase in intramitochondrial membrane contacts by a hormone- 
dependent, cycloheximide-sensitive mechanism may regulate cholesterol transport to the CSCC 
(Jefcoate, et aL, 1986). Thus, in the acute regulation of steroidogenesis, a putative function for 
1 0 the newly synthesized regulatory protein may be to facilitate the formation of mitochondrial 

contact sites that would result in an increased rate of transfer of cholesterol to the inner 
3 membrane and CSCC which ultimately would result in the observed increase in the rate of 
f l steroid production. However, the search for these cycloheximide-sensitive regulatory protein(s) 
HI has been ongoing for nearly 30 years, but, as yet, the mechanism of cholesterol transfer to the 
5 CSCC is not known. 

H The present inventors have previously identified a family of hormone-induced 

M mitochondrial proteins in MA- 1 0 cells that regulate cholesterol delivery to the inner 
n mitochondrial membrane and the CSCC. These proteins have been described as the 
W mitochondrial 37kDa, 32kDa, and 30kDa molecular weight proteins and they are synthesized in 
response to either LH and hCG or by stimulation with the CAMP analogue, B^cAMP (Stocco 
and Kilgore, 1988). The 30kDa species consists of four separate proteins and proteolytic 
digestion of all four forms indicates that they are all modified forms of the same protein (Stocco 
and Chen, 1991), Pulse chase experiments and tryptic peptide mapping of the 37kDa and 30kDa 
25 proteins indicated that the 37kDa form is a precursor to the 30kDa protein (Stocco and Sodeman, 
1991 ; Epstein and Orme-Johnson, 1991). These reports, however, lack information regarding the 
structure of the nucleic acid molecules and protein molecules involved in steroidogenesis. 
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Lipoid congenital adrenal hyperplasia (LC AH) is a lethal autosomal recessive disease that 
results in a complete inability of a newborn infant to synthesize steroids. The lack of 
mineralocorticoids and glucocorticoids results in death within days to weeks of birth if not 
detected and treated with adequate steroid hormone replacement therapy. This condition is 
5 manifested by the presence of large adrenals containing very high levels of cholesterol and 
cholesterol esters and also by an increased amount of lipid accumulation in testicular Ley dig 
cells, though this level is somewhat lower than that seen in adrenals. As isolated, mitochondria 
from adrenals and gonads of affected patients cannot convert cholesterol to pregnenolone 
(Camacho et al., 1968; Degenhart et a/., 1972; Koizumi et al, Hauffa et al, 1985). The P450scc 

10 enzyme that converts cholesterol to pregnenolone has been shown to be normal in patients who 
suffered from this disease (Lin et al, 1991). Thus, the defect lies upstream of P450scc at the 

% point of cholesterol delivery to the enzyme. 

m Prior art lacks sufficient identification of the agent(s) responsible for the LCAH 

Jf> metabolic defect and defects in cholesterol transport, lacks screening methods for their detection, 
^ and lacks provision of pharmacological agents effective in alleviating the defects. Because of 
CI these problems, known procedures are not completely satisfactory despite efforts of persons 
U skilled in the art, and the present inventors have searched for improvements. Further 
±! characterization of agents involved in these defects at the amino acid and nucleic acid levels 
3D would provide potential solutions and alternatives to resolving these and other problems in the 
art. 

SUMMARY OF THE INVENTION 

The present invention seeks to overcome these and other drawbacks inherent in the prior 
25 art by providing compositions and methods including the nucleotide sequence of the gene 

encoding a steroidogenic acute regulatory protein (StAR) protein, or compositions that include a 
protein having the amino acid sequence of the StAR protein. The fundamental importance of the 
StAR protein is that it is the acute regulator of a key step in the steroidogenic biosynthetic 
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pathway. Importantly, the production of mineralocorticoids, glucocorticoids, and sex hormones 
are dependent on the expression of this protein. 

Nucleic acid molecules having nucleotide sequences of the gene encoding StAR may be 
5 used in a variety of different diagnostic applications, including the evaluation of gene defects 
associated with steroid hormone production. The hormonally induced, CAMP-mediated acute 
regulation of steroid hormone biosynthesis in steroidogenic cells is characterized by the 
mobilization of cholesterol from cellular stores to the mitochondria outer membrane, and its 
translocation to the inner membrane where the conversion of cholesterol to pregnenolone occurs. 
1 0 Steroid hormone-dependent disorders that may be addressed using compositions and methods of 
the present invention include lipoid congenital adrenal hyperplasia, infertility, sexual maturation, 
J3 androgen-responsive tumors, precocious puberty, McCune- Albright syndrome, adrenal- 
71 hypoplasia congenita, or hypogonadotropic hypogonadism. 

J|5 Further, in pregnancy induced diabetes, progesterone levels are lower than normal and the 

T fetus may be aborted spontaneously. The level of StAR protein may be deficient in these 
J3 patients and it may be possible as a result of the present invention to monitor levels of StAR in 
H pregnancy for predicting patients that may be at risk for spontaneous abortion. 

^#0 The nucleic acid molecules of the invention may further be used to provide recombinant 

preparations of the StAR protein. These highly purified preparations can also be provided in 
relatively high yield in conjunction with techniques well known to those of skill in the art of 
recombinant technology* 

25 In certain aspects, the invention relates to a purified nucleic acid molecule having a 

nucleotide sequence encoding a steroidogenic acute regulatory protein, the protein having an 
amino acid sequence as set forth in SEQ ID NO: 2 or SEQ ID NO: 18, or sequences that are about 
80% to about 99% identical to that sequence. "Purified" nucleic acid molecules having a 
nucleotide sequence encoding steroidogenic acute regulatory protein (StAR), in some 
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embodiments of the invention, means a StAR encoding nucleic acid molecule substantially free 
of nucleic acid molecules not encoding an amino acid sequence having about 80% to about 99% 
identity to the sequence set forth in SEQ ID NO:2 or SEQ ID NO: 18. A further embodiment of 
the invention is a purified nucleic acid molecule having a nucleotide sequence encoding a 
5 steroidogenic acute regulatory protein antigen, the antigen having an amino acid sequence having 
about 80% to about 99% identity to the sequence set forth in SEQ ID NO: 8, and the nucleic acid 
molecule being substantially free of nucleic acid molecules not encoding the steroidogenic acute 
regulatory protein antigen. 

1 0 The term "essentially as set forth in SEQ ID NO:2, SEQ ID NO: 1 8 or SEQ ID NO:8 M 

means that the sequence substantially corresponds to a portion of the indicated sequence and has 
5 relatively few amino acids which are not identical to, or a biologically functional equivalent of, 
f! the amino acids set forth. The term "biologically functional equivalent" is well understood in the 
W art and is further defined as a protein having a sequence essentially as set forth in the indicated 
1J5 SEQ ID NO, and that is involved in the transfer of cholesterol from cellular stores to the inner 
; mitochondrial membrane. Accordingly, sequences which have between about 70% and about 
D 80%; or more preferably, between about 8 1 % and about 90%; or even more preferably, between 

about 9 1 % and about 99%; of amino acids that are identical or functionally equivalent to the 
5 amino acids of the indicated SEQ ID NO:2, 18, or 8 will be sequences which are "essentially as 
W set forth in SEQ ID NO:2, 18, or 8". 

Some embodiments of the above-described nucleic acid molecule are those wherein the 
steroidogenic acute regulatory protein has the amino acid sequence of SEQ ID NO: 2 or 18. In 
further defined embodiments, the invention provides the above-described nucleic acid molecule 
25 wherein the amino acid sequence begins with amino acid methionine at position 48 of SEQ ID 

NO:2 and extends through amino acid cysteine at position 284 of SEQ ID NO:2. Amino acids 1- 
47 constitute the signal sequence which is cleaved during processing to the mature protein as 
described in Example 2. 
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A further embodiment of the present invention is where the nucleic acid molecule has a 
nucleotide sequence as set forth in SEQ ID NO:l or 19, and preferably, the nucleic acid molecule 
is further defined as including a detectable label. 



5 Nucleic Acids 

Some embodiments of the present invention provide purified nucleic acid molecules that 
encode StAR protein having an amino acid sequence essentially as set forth in SEQ ID NO:2 or 
1 8. As used herein, the terms "nucleic acid molecule" may refer to a DNA or RNA molecule 
which has been isolated free of total genomic DNA, or free of total RNA, of a particular species. 
10 Therefore, a "purified" nucleic acid molecule as used herein, refers to a nucleic acid molecule 
^ that contains a StAR coding sequence yet is isolated away from, or purified free from, total 
H genomic DNA or total RNA, for example, total human genomic DNA. Included within the tertn 
jM* "DNA molecule", are DNA segments and smaller fragments of such segments, and also 
m recombinant vectors, including, for example, plasmids, cosmids, phage, viruses, and the like. 

.f ■ 

Another embodiment of the present invention is a purified nucleic acid molecule, further 
%: defined as including a nucleotide sequence in accordance with SEQ ID NO:l or 19. In some 
~ embodiments, the purified nucleic acid segment comprises a nucleotide sequence having about 
O 75% to about 99% identity with the sequence at SEQ ID NO: 1 or with the StAR coding 
20 sequences thereof (see FIG. 3). Such nucleotide sequences are more particularly defined as being 
substantially free of nucleic acids not encoding the StAR protein. 

Similarly, a DNA molecule comprising an isolated or purified StAR gene refers to a 
DNA molecule including StAR coding sequences isolated substantially away from other 
25 naturally occurring genes or protein encoding sequences. In this respect, the term "gene" is used 
for simplicity to refer to a functional protein, polypeptide or peptide encoding unit As will be 
understood by those in the art, this functional term includes both genomic sequences, cDNA 
sequences or combinations thereof. "Isolated substantially away from other coding sequences" 
means that the gene of interest, in this case the StAR encoding gene, forms the significant part of 
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the coding region of the DNA molecule, and that the DNA molecule does not contain large 
portions of naturally-occurring coding DNA, such as large chromosomal fragments or other 
functional genes or cDNA coding regions. Of course, this refers to the DNA molecule as 
originally isolated, and does not exclude genes or coding regions later added to the segment by 
5 the hand of man. 

Another embodiment of the present invention is a purified nucleic acid molecule that 
encodes a protein having a sequence with about 80% to about 99% identity to SEQ ID NO:2 or 
1 8, further defined as a recombinant vector. As used herein, the term "recombinant vector", 

10 refers to a vector that has been modified to contain a nucleic acid segment that encodes a StAR 
protein, or fragment of interest thereof. The recombinant vector may be further defined as an 

yg expression vector comprising a promoter operatively linked to said StAR encoding nucleic acid 
molecule. In some embodiments, the recombinant vector comprises a nucleic acid sequence in 
[U accordance with SEQ ID NO: 1 or 19, or with about 80% to about 99% identity to the sequence 
|5 of SEQ ID NO: 1 or 1 9. By way of example and not limitation, vectors may be further defined as 
T a pCMV, pUC and derivatives thereof, SV40, adenoviral, retroviral, yeast plasmids, Baculovirus 
^ or Vaccinia virus vector. 

A further preferred embodiment of the present invention is a host cell, made recombinant 

11 with a recombinant vector comprising a StAR gene. The recombinant host cell may be a 
prokaryotic or a eukaryotic cell. In a more preferred embodiment, the recombinant host cell is a 
eukaryotic cell. As used herein, the term "engineered" or "recombinant" cell is intended to refer 
to a cell into which a recombinant gene, such as a gene encoding StAR, has been introduced. 
Therefore, engineered cells are distinguishable from naturally occurring cells which do not 

25 contain a recombinant introduced gene. Thus, engineered cells are cells having a gene or genes 
introduced through the hand of man. Recombinant^ introduced genes will either be in the form 
of a cDNA gene (i.e., they will not contain introns), a copy of a genomic gene, or will include 
genes positioned adjacent to a promoter not naturally associated with the particular introduced 
gene, or combinations thereof. Preferred host cells may be further defined as a Leydig cell 



(primary or MA- 10 cells), an adrenalcortical cell such as the H295 human adrenalcortical cell 
line, a primary culture ovarian granulosa cell, a COS cell, Saccharomyces cerevisiae, or 
Escherichia coli cell. 

Generally speaking, it may be more convenient to employ as the recombinant gene a 
cDNA version of the gene. It is believed that the use of a cDNA version will provide advantages 
in that the size of the gene will generally be much smaller and more readily employed to 
transfect the targeted cell than will a genomic gene, which will typically be up to an order of 
magnitude larger than the cDNA gene. However, the inventors do not exclude the possibility of 
employing a genomic version of a particular gene where desired. 

In certain embodiments, the invention concerns isolated DNA molecules and recombinant 
vectors which encode a protein or peptide that includes within its amino acid sequence an amino 
acid sequence essentially as set forth in SEQ ID NO:2 or 18. Naturally, where the DNA segment 
or vector encodes a full length StAR protein, or is intended for use in expressing the StAR 
protein, the most preferred sequences are those which are essentially as set forth in SEQ ID NO:2 
or 18. 

In certain other embodiments, the invention concerns isolated DNA segments and 
recombinant vectors that include within their sequence a nucleic acid sequence essentially as set 
forth in SEQ ID NO: 1 or 19. The term "essentially as set forth in SEQ ID NO: 1 or 19", is used 
in the same sense as described above and means that the nucleic acid sequence substantially 
corresponds to a portion of SEQ ID NO: 1 or 1 9, and has relatively few codons which are not 
identical, or functionally equivalent, to the codons of SEQ ID NO: 1 or 19. The term 
"functionally equivalent codon" is used herein to refer to codons that encode the same amino 
acid, such as the six codons for arginine or serine, as set forth in Table 1, and also refers to 
codons that encode biologically equivalent amino acids. 
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It will also be understood that amino acid and nucleic acid sequences may include 
additional residues, such as additional N- or C-terminal amino acids or 5' or 3* sequences, and yet 
still be essentially as set forth in one of the sequences disclosed herein, so long as the sequence 
meets the criteria set forth above, including the maintenance of biological protein activity where 
5 protein expression is concerned. The addition of terminal sequences particularly applies to 

nucleic acid sequences which may, for example, include various non-coding sequences flanking 
either of the 5' or 3' portions of the coding region or may include various internal sequences, i.e., 
introns, which are known to occur within genes. The nucleic acid segments of the present 
invention, regardless of the length of the coding sequence itself, may be combined with other 

1 0 DNA sequences, such as promoters, poiyadeny iation signals, additional restriction enzyme sites, 
multiple cloning sites, other coding segments, and the like, such that their overall length may 
vary considerably. 

Ill Excepting intronic or flanking regions, and allowing for the degeneracy of the genetic 

15 code, sequences which have between about 70% and about 90%; or more preferably, between 
* about 80% and about 90%; or between about 80% and about 99%; of nucleotides which are 
D identical to the nucleotides of SEQ ID NO:l or 19 will be sequences which are "essentially as set 

11 forth in SEQ ID NO: 1 or 19". Sequences which are essentially the same as those set forth in 
y SEQ ID NO: 1 or 19 may also be functionally defined as sequences which are capable of 

90 hybridizing to a nucleic acid segment containing the complement of SEQ ID NO: 1 or 19 under 
relatively stringent conditions. Suitable relatively stringent hybridization conditions will be well 
known to those of skill in the art and are clearly set forth herein, for example conditions for use 
with Northern blot analysis, and as described in the various embodiments of the invention 
provided herein and particularly in Example 2. 
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s 


Asparagine 


Asn 


XT 

N 


A AP 

AAU 


A ATT 

AAU 












Proline 


rro 


p 


CPA 


ccc 


CCG 


ecu 








Glutamine 


/"•I.. 

(jln 


Q 


PA A 
LAA 


PAG 












Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


CGG 


CGU 




Serine 


Ser 


S 


AGC 


AGU 


UCA 


UCC 


UCG 


ucu 


20 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 








Valine 


Val 


V 


GUA 


GUC 


GUG 


GUU 








Tryptophan 


Tip 


W 


UGG 














Tyrosine 


Tyr 


Y 


UAC 


UAU 











25 The present invention includes a purified nucleic acid molecule complementary, or 

essentially complementary, to the nucleic acid molecule having the sequence set forth in SEQ ID 
NO: 1 or 19. Nucleic acid sequences which are "complementary" are those. which are capable of 
base-pairing according to the standard Watson-Crick complementarity rules. As used herein, the 
term "complementary sequences" means nucleic acid sequences which are substantially 
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complementary, as may be assessed by the same nucleotide comparison set forth above, or as 
defined as being capable of hybridizing to the nucleic acid segment of SEQ ID NO:l or 19 under 
relatively stringent conditions such as those described herein in the detailed description of the 
preferred embodiments. Complementary nucleotide sequences are useful for detection and 
purification of hybridizing nucleic acid molecules. A preferred embodiment of the invention is a 
molecule complementary to SEQ ID NO:l or 19 and is a cDNA molecule complementary to a 
steroidogenic acute regulatory protein mRNA. 

The present inventors also envision the preparation of fusion proteins and peptides, e.g., 
where the StAR coding regions are aligned within the same expression unit with other proteins or 
peptides having desired functions, such as for purification or immunodetection purposes (e.g., 
proteins which may be purified by affinity chromatography and enzyme label coding regions, 
respectively). 

StAR protein has been successfully expressed in eukaryotic expression systems by the 
present inventors, for example, using the PCMV vector in COS cells. Other expression systems 
contemplated by the present inventors include, e.g., baculovirus-based, glutamine synthase- 
based, dihydrofolate reductase-based systems, or the like. For expression in this manner, one 
would position the coding sequences adjacent to and under the control of the promoter. It is 
understood in the art that to bring a coding sequence under the control of such a promoter, one 
positions the 5* end of the transcription initiation site of the transcriptional reading frame of the 
protein between about 1 and about 50 nucleotides "downstream" of (i.e., 3 f of) the chosen 
promoter. 

Where eukaryotic expression is contemplated, one will also typically desire to incorporate 
into the transcriptional unit which includes the StAR gene, an appropriate polyadenylation site 
(e.g., S'-AATAAAO') if one was not contained within the original cloned segment Typically, 
the poly A addition site is placed about 30 to 2000 nucleotides "downstream" of the termination 
site of the protein at a position prior to transcription termination. 
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It is contemplated that virtually any of the commonly employed host cells can be used in 
connection with the expression of StAR in accordance herewith. Examples include cell lines 
typically employed for eukaryotic expression such as COS, CHO, MA-10 cells, or 
Saccharomyces cerevisiae. 

It is proposed that transformation of host cells with DNA segments encoding the StAR 
protein will provide a convenient means for obtaining purified StAR protein. It is also proposed 
that cDNA, genomic sequences, and combinations thereof, are suitable for eukaryotic expression, 
as the host cell will process the genomic transcripts to yield functional mRNA for translation into 
protein. Other embodiments of the invention comprise compositions comprising a purified RNA 
molecule corresponding to StAR. Such embodiments, by way of example, have a nucleotide 
sequence of SEQ ID NO: 14, 15, 16, or 17. 

Nucleic Add Hybridization and PCR reactions. 

Oligonucleotide sequences based on the mouse or a homologous sequence of StAR may 
be used as primers in a polymerase chain reaction to screen for possible mutations in StAR 
mRNA causing a variety of pathologies, for example, the lethal human disease, lipoid congenital 
adrenal hyperplasia. Therefore, StAR nucleic acid sequence can be applied to screen prenatal, 
perinatal, or neonatal DNA for possible mutations in StAR. If the disease is detected early, then 
continual mineralocorticoid, glucocorticoid, or steroid replacement therapy can prolong the life 
of the patient Further applications will arise when additional disease states are linked to 
mutations in the StAR gene, or under conditions where mutations in related genes result in 
decreased levels of StAR mRNA or protein. In these cases, analysis of StAR mRNA or protein 
has significant diagnostic value. 

DNA probes and primers useful in hybridization studies and PCR reactions may be 
derived from any portion of SEQ ID NO: 1 or 1 9 and are generally at least about seventeen 
nucleotides in length. Therefore, probes and primers are specifically contemplated that comprise 
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nucleotides 1 to 17, or 2 to 18, or 3 to 19 and so forth up to a probe comprising the last 17 
nucleotides of the nucleotide sequence of SEQ ID NO:l or 19. Thus, each probe would comprise 
at least about 17 linear nucleotides of the nucleotide sequence of SEQ ID NO:l or 19, designated 
by the formula M n to n + 16," where n is an integer from 1 to about 1435, Longer probes that 
5 hybridize to the StAR gene under low, medium, medium-high and high stringency conditions are 
also contemplated, including those that comprise the entire nucleotide sequence of SEQ ID NO:l 
or 19. Selected oligonucleotide subportions of the gene encoding StAR have significant utility 
irrespective of whether they encode antigenic peptides. In these aspects, it is contemplated, for 
example, that shorter or longer nucleic acid fragments of the StAR gene, prepared synthetically, 
10 recombinantly, or otherwise, can be employed as hybridization probes. Such probes can be 

readily employed in a variety of ways, including their use in the identification of genes encoding 
% StAR in biological tissues or clinical samples, as well as in the detection and evaluation of StAR 
P; in pathologies that relate to cholesterol and/or steroid synthesis. Biological or clinical samples 
01 include, but are not limited to, biopsy specimens from adrenal or gonadal tissue, or blood, for 
|| example. 

G A general method for preparing oligonucleotides of various lengths and sequences is 

%2 described by Caracciolo et al (1989). In general, there are two commonly used solid phase- 
tf based approaches to the synthesis of oligonucleotides containing conventional 5'-3' linkages, one 
it) involving intermediate phosphoramidites and the other involving intermediate phosphonate 
linkages. In the phosphoramidite synthesis a suitably protected nucleotide having a 
cyanoethylphosphoramidate at the position to be coupled is reacted with the free hydroxyl of a 
growing nucleotide chain derivatized to a solid-support The reaction yields a 
cyanoethylphosphite, which linkage must be oxidized to the cyanoethylphosphate at each 
25 intermediate step, since the reduced form is unstable to acid. 

The phosphonate based synthesis is conducted by the reaction of a suitably protected 
nucleotide containing a phosphonate moiety at a position to be coupled with a solid phase- 
derivatized nucleotide chain having a free hydroxyl group, in the presence of a suitable activator 
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to obtain a phosphonate ester linkage, which is stable to acid. Thus, the oxidation to the 
phosphate or thiophosphate can be conducted at any point during synthesis of the oligonucleotide 
or after synthesis of the oligonucleotide is complete. 

The phosphonates can also be converted to phosphoramidate derivatives by reaction with 
a primary or secondary amine in the presence of carbon tetrachloride. To indicate the two 
approaches generically, the incoming nucleotide is regarded as having an "activated" 
phosphite/phosphate group. In addition to employing commonly used solid phase synthesis 
techniques, oligonucleotides may also be synthesized using solution phase methods such as 
triester synthesis. The methods are workable, but in general, less efficient for oligonucleotides of 
any substantial length. 

Preferred oligonucleotides resistant to in vivo hydrolysis may contain a phosphorothioate 
substitution at each base. Oligodeoxynucleotides or their phosphorothioate analogues may be 
synthesized using an Applied Biosystem 380B DNA synthesizer (Applied Biosystems, Inc., 
Foster City, CA). 

A further embodiment of the invention is a purified nucleic acid molecule having at least 
a 17, 20, 25, 30, 50, 100, 200, 500, or 1000 nucleotide sequence that corresponds to, or is capable 
of hybridizing to the nucleic acid sequence of SEQ ID NO:l or 19 under conditions standard for 
hybridization fidelity and stability. Furthermore, it is contemplated that nucleic acid molecules 
having a nucleotide sequence of SEQ IDNOS:l,9, 10, 11, 12, 13, or 19 for stretches of between 
about 10 nucleotides to about 20 or to about 30 nucleotides will find particular utility, with even 
longer sequences, e.g., 40, 50, 150, 250, 450, even up to full length, being more preferred for 
certain embodiments. The ability of such nucleic acid probes to specifically hybridize to StAR 
nucleic acid sequences will enable them to be of use in a variety of embodiments. For example, 
the probes can be used in a variety of assays for detecting the presence of complementary 
sequences in a given sample. However, other uses are envisioned, including the use of the 
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sequence information for the preparation of mutant species primers, or primers for use in 
preparing other genetic constructions. 

These probes will be useful in hybridization embodiments, such as Southern and 
5 Northern blotting. The total size of fragment, as well as the size of the complementary 

stretch(es), will ultimately depend on the intended use or application of the particular nucleic 
acid segment. Smaller fragments will generally find use in hybridization embodiments, wherein 
the length of the complementary region may be varied, such as between about 20 and about 40 
nucleotides, or even up to the full length of the nucleic acid as shown in SEQ ID NOS: 1, 9-13, 
10 or 19 according to the complementary sequences one wishes to detect. 

C The use of a hybridization probe of about 10 nucleotides in length allows the formation of 

P a duplex molecule that is both stable and selective. Molecules having complementary sequences 

over stretches greater than 10 bases in length are preferred, though, in order to increase stability 
|| and selectivity of the hybrid, and thereby improve the quality and degree of specific hybrid 

molecules obtained. One will generally prefer to design nucleic acid molecules having gene- 
r? complementary stretches of 1 5 to 20 nucleotides, or even longer where desired. Such fragments 
J: may be readily prepared by, for example, directly synthesizing the fragment by chemical means, 
D by application of nucleic acid reproduction technology, such as the PCR technology of U.S. 
25 Patent 4,683,202 (herein incorporated by reference) or by introducing selected sequences into 

recombinant vectors for recombinant production. 

Depending on the application envisioned, one will desire to employ varying conditions of 
hybridization to achieve varying degrees of selectivity of probe towards target sequence. For 
25 applications requiring high selectivity, one will typically desire to employ relatively stringent 
conditions to form the hybrids, e.g., one will select relatively low salt and\or high temperature 
conditions, such as provided by 0.02M-0.15M NaCl at temperatures of 50°C to 70°G Such 
selective conditions tolerate little, if any, mismatch between the probe and the template or target 
strand. 
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Where one desires to prepare mutants employing a mutant primer strand hybridized to an 
underlying template or where one seeks to isolate sequences from related species, functional 
equivalents, or the like, less stringent hybridization conditions will typically be needed in order 
to allow formation of the heteroduplex. In these circumstances, one may desire to employ 
conditions such as 0.15M-0.9M salt, at temperatures ranging from 20°C to 55 °C. Cross- 
hybridizing species can thereby be readily identified as positively hybridizing signals with 
respect to control hybridizations. In any case, it is generally appreciated that conditions can be 
rendered more stringent by the addition of increasing amounts of formamide, which serves to 
destabilize the hybrid duplex in the same manner as increased temperature. Thus, hybridization 
conditions can be readily manipulated, and thus will generally be a method of choice depending 
on the desired results. 

In certain embodiments, it will be advantageous to employ nucleic acid sequences of the 
present invention in combination with an appropriate means, such as a label, for determining 
hybridization. A wide variety of appropriate indicator means are known in the art, including 
fluorescent, radioactive, enzymatic or other ligands, such as avidin/biotin, which are. capable of 
giving a detectable signal. In preferred embodiments, one will likely desire to employ a 
fluorescent label or an enzyme tag, such as urease, alkaline phosphatase or peroxidase, instead of 
radioactive or other environmental undesirable reagents. In the case of enzyme tags, colorimetric 
indicator substrates are known which can be employed to provide a means visible to the human 
eye or spectrophotometrically, to identify specific hybridization with complementary nucleic 
acid-containing samples. 

* 

In general, it is envisioned that the hybridization probes described herein will be useful 
both as reagents in solution hybridization as well as in embodiments employing a solid phase. In 
embodiments involving a solid phase, the test DNA (or UNA) is adsorbed or otherwise affixed to 
a selected matrix or surface. This fixed, single-stranded nucleic acid is then subjected to specific 
hybridization with selected probes under desired conditions. The selected conditions will depend 
on the particular circumstances based on the particular criteria required (depending, for example, 
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on the G+C contents, type of target nucleic acid, source of nucleic acid, size of hybridization 
probe, etc). Following washing of the hybridized surface so as to remove nonspecifically bound 
probe molecules, specific hybridization is detected, or even quantified, by means of the label. 

Longer DNA segments will often find particular utility in the recombinant production of 
peptides or proteins. DNA segments which encode peptides from about 15 to about 50 amino 
acids in length, or more preferably, from about 15 to about 30 amino acids in length comprising 
the amino acid sequence of SEQ ID NO:8 are contemplated to be particularly useful. DNA 
segments encoding peptides will generally have a minimum coding length in the order of about 
45 to about 150, or to about 90 nucleotides. DNA segments encoding full length proteins may 
have a minimum coding length in the order of about 2000 nucleotides for a protein or otherwise 
biologically active equivalent peptide having at least a sufficient portion of the sequence in 
accordance with SEQ ID NO:2 or 18 capable of providing said StAR-biological activity. 

The nucleic acid segments of the present invention, regardless of the length of the coding 
sequence itself, may be combined with other DNA sequences, such as promoters, 
polyadenylation signals, additional restriction enzyme sites, multiple cloning sites, other coding 
segments, and the like, such that their overall length may vary considerably. It is contemplated 
that a nucleic acid fragment of almost any length may be employed, with the total length 
preferably being limited by the ease of preparation and use in the intended recombinant DNA 
protocol. For example, nucleic acid fragments may be prepared in accordance with the present 
invention which are up to 10*000 base pairs in length, with segments of 5,000 or 3,000 being 
preferred and segments of about 1,000 base pairs in length being particularly preferred. 

It will be understood that this invention is not limited to the particular nucleic acid and 
amino acid sequences having sequence identifiers as listed in Table 2. Therefore, DNA segments 
prepared in accordance with the present invention may also encode biologically functional 
equivalent proteins or peptides which have variant amino acid sequences. Such sequences may 
arise as a consequence of codon redundancy and functional equivalency which are known to 

-19- 



occur naturally within nucleic acid sequences and the proteins thus encoded. Alternatively, 
functionally equivalent proteins or peptides may be constructed via the application of 
recombinant DNA technology, in which changes in the protein structure may be engineered, 
based on considerations of the properties of the amino acids being exchanged. 

Any of a variety of steroidogenic cells may be used as a source to prepare the purified 
StAR protein of the invention having an amino acid sequence essentially as set forth in SEQ ID 
NO:2 or sequences having substantial identity thereto. Substantial identity as used in the 
definition of the present invention is intended to mean amino acid sequences or nucleic acid 
sequences that have an about 75% to about 99% identical sequence to that of the referenced SEQ 
ID NO. For example, such a relationship exists between SEQ ID NO:2 (amino acid - mouse) and 
(amino acid - human) and SEQ ID NO:l (nucleic acid - mouse) and SEQ ID NO: 19 (nucleic acid 
- human). By way of example, particularly useful cells include adrenal fasciculata, adrenal 
glomerulosa, corpus luteum cells, ovarian theca, ovarian granulosa, mouse Y-l adrenalcortical 
tumor cells, primary Leydig cell cultures and MA-10 Leydig tumor cells. The cell line employed 
to prepare a mitochondrial extract for purposes of isolating the herein described StAR protein 
may comprise Leydig cell cultures and MA-10 Leydig tumor cells. 

Table 2 lists the identity of sequences of the present disclosure having sequence 
identifiers. 
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Table 2 



Identification of Sequences Having Sequence Identifiers 



SEQIDNO: 


IDENTITY 


1 


DNA sequence encoding StAR (mouse) 


2 


Protein sequence of 30kDa StAR (mouse) 


3 


peptide 23 (mouse) 


4 


peptide 25 (mouse) 


5 


peptide 45 (mouse) 


6 


Asn Gin Glu Gly Trp Lys 


7 


Ala Glu His Gly Pro Thr Cys Met Val 


8 


amino acids 88-98 of SEQ ID NO:2 (mouse) 


9 


degenerate oligonucleotides made to peptide 23; coding 
direction (mouse) 


10 


degenerate oligonucleotides made to peptide 23; noncoding 
direction (mouse) 


11 


degenerate oligonucleotides made to peptide 25; coding 
direction (mouse) 


12 


degenerate oligonucleotides made to peptide 25; noncoding 
direction (mouse) 


13 


PCR product, nucleotides 343-743 of SEQ ID NO:l (mouse) 


14 


RNA sequence encoding StAR (mouse) 


15 


RNA sequence encoding StAR peptide (human), (nucleic 
acid position 267-988 coding region) 


16 


RNA sequence encoding StAR peptide (human) (nucleic acid 
position 127-260 coding region). 


17 


RNA sequence for human noncoding StAR region (nucleic 
acid position 1051-1069). 


18 


Amino acid sequence encoding StAR (human) (amino acid 
1-285) 




DNA sequence encoding StAR (human) 
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StAR Protein Compositions 

In particular aspects, the present invention provides a purified StAR protein having an 
amino acid sequence essentially as set forth in SEQ ID NO:2 or 18. In a further embodiment of 
the composition, the amino acid sequence begins at the amino acid methionine at position 48 of 
SEQ ID NO: 2 and extends through amino acid cysteine at position 284 of SEQ ID NO:2. 

The StAR protein may be phosphorylated or unphosphorylated. The mature 30 kDa form 
of StAR protein has four different isoelectric species, designated as 30 kDa 1, 2, 3, and 4, with 1 
being the most basic form and 4 the most acidic form. Studies by the present inventors 
demonstrated that forms 3 and 4 were phosphorylated forms of 1 and 2, and that phosphorylation 
is important for biological activity. These forms of the 30 kDa protein are useful as molecular 
weight standards, and as standards for isoelectric focusing. Threonine, serine, and tyrosine 
amino acids are most frequently those amino acids in a protein that are phosphorylated, and in 
the case of the StAR protein, a threonine may be phosphorylated. 

The purified 37 kDa StAR protein is expected to have many different uses, including, for 
example, supplementing a patient lacking StAR activity to provide proper cholesterol transport 
and subsequent synthesis of steroids. 

In some aspects of the peptides of the StAR protein, the peptides comprise an amino acid 
sequence in accordance with SEQ ID NO:3, 4, 5, 6, 7, or 8. These peptides are useful for 
designing oligonucleotides for screening and for identifying antigenic determinants of the StAR 
protein (see examples). Peptides having less than about 45 amino acid residues may be 
chemically synthesized by the solid phase method of Merrifield (1963), which reference is 
specifically incorporated by reference herein, using an automatic peptide synthesizer with 
standard t-butoxycarbonyl (t-Boc) chemistry that is well known to one skilled in this art in light 
of this disclosure. The amino acid composition of the synthesized peptides may be determined 
by amino acid analysis with an automated amino acid analyzer to confirm that they correspond to 
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the expected compositions. The purity of the peptides may be determined by sequence analysis 
orHPLC 

In still another embodiment of the present invention, methods of preparing a StAR 
5 protein composition are provided. In one aspect, the method comprises growing recombinant 
host cells comprising a vector that encodes a protein which includes an amino acid sequence in 
accordance with SEQ ID NO:2 or 18, or that includes a nucleic acid sequence as defined in SEQ 
ID NO:l or SEQ ID NO; 19, under conditions permitting nucleic acid expression and protein 
production followed by recovering the protein so produced. The host cell, conditions permitting 
1 0 nucleic acid expression, protein production and recovery, will be known to those of skill in the 
art, in light of the present disclosure of the StAR gene. A preferred host cell is a COS cell. 

L Modifications and changes may be made in the sequence of the StAR peptides or protein 

of the present invention and still obtain a peptide or protein having like or otherwise desirable 
1# characteristics. For example, certain amino acids may be substituted for other amino acids in a 
V peptide without appreciable loss of function. Since it is the interactive capacity and nature of an 
CI amino acid sequence that defines the peptide's functional activity, certain amino acid sequences 
^ may be chosen (or, of course, its underlying DNA coding sequence) and nevertheless obtain a 
5 peptide with like properties. It is thus contemplated by the inventors that certain changes may be 
i$ made in the sequence of a StAR peptide or protein (or underlying DNA) without appreciable loss 
of its ability to function. 

Substitution of like amino acids can be made on the basis of hydrophilicity. U.S. Patent 
4,554,101, incorporated herein by reference, states that the following hydrophilicity values have 
25 been assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate (+3.0 ± 1); 

glutamate (+3.0 ± 1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); glycine (0); threonine 
(-0.4); proline (-0.5 ± 1); alanine (-0.5); histidine (-0.5); cysteine (-1.0); methionine (-1.3); valine 
(-1.5); leucine (-1.8); isoleucine (-1.8); tyrosine (-2,3); phenylalanine (-2.5); tryptophan (-3,4). It 
is understood that an amino acid can be substituted for another having a similar hydrophilicity 
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value and still obtain a biologically equivalent peptide. In such changes, the substitution of 
amino acids whose hydrophilicity values are within ±2 is preferred, those which are within ±1 
are more preferred, and those within ±0.5 are most preferred. 

As outlined above, amino acid substitutions are generally therefore based on the relative 
similarity of the amino acid side-chain substituents, for example, their hydrophobicity, 
hydrophilicity, charge, size, and the like. Exemplary substitutions which take various of the 
foregoing characteristics into consideration are well known to those of skill in the art and 
include: arginine and lysine; glutamate and aspartate; serine and threonine; glutamine and 
asparagine; and valine, leucine and isoleucine. 

Two designations for amino acids are used interchangeably throughout this application, 
as is common practice in the art. Alanine = Ala (A); Arginine = Arg (R); Aspartate = Asp (D); 
Asparagine = Asn (N); Cysteine » Cys (C); Glutamate = Glu (E); Glutamine = Gin (Q); Glycine 
= Gly (G); Histidine = His (H); Isoleucine = lie (I); Leucine = Leu (L); Lysine « Lys (K); 
Methionine = Met (M); Phenylalanine = Phe (F); Proline^ Pro (P); Serine = Ser (S); Threonine^ 
Thr (T); Tryptophan - Trp (W); Tyrosine = Tyr (Y); Valine= Val (V). 

While discussion has focused on functionally equivalent polypeptides arising from amino 
acid changes, it will be appreciated that these changes may be effected by alteration of the 
encoding DNA, taking into consideration also that the genetic code is degenerate and that two or 
more codons may code for the same amino acid. 

Another aspect of the invention is a method of preparing a recombinant steroidogenic 
acute regulatory protein encoded by the purified nucleic acid molecule having a nucleotide 
sequence of SEQ ID NO: 1 or 19, the method comprising the steps of preparing a recombinant 
host bearing the nucleic acid molecule, the host being capable of expressing the protein, culturing 
the recombinant host to produce steroidogenic acute regulatory protein, and collecting the 
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recombinant steroidogenic acute regulatory protein having an amino acid sequence essentially as 
set forth in SEQ ID NO:2 or 18. In one aspect, the recombinant host is a COS cell. 

A further embodiment of the present invention relates to a purified nucleic acid molecule 
5 encoding StAR protein having an amino acid sequence essentially as set forth in SEQ ID NO: 2, 
said nucleic acid molecule obtained by a process of; i) preparing oligonucleotides that encode a 
segment of an amino acid sequence of SEQ ID NO:2 and that have at least about 17 nucleotides; 
ii) screening an animal cell DNA library with said oligonucleotides; and iii) obtaining the 
purified nucleic acid molecule encoding StAR protein having an amino acid sequence essentially 
10 as set forth in SEQ ID NO: 2 or 18. 

g Pharmaceutical Preparations. 

M Another aspect of the present invention provides therapeutic agents for the treatment of 

m steroid hormone-dependent disorders in an animal. The therapeutic agent comprises an 

rl admixture of StAR peptide or protein in a pharmaceutical^ acceptable excipient. Most 

* preferably, the therapeutic agent will be formulated so as to be suitable for injection. 

3 Pharmacologically active peptides of StAR may also be provided to a subject via gene therapy. 

U Many different vehicles exist for accomplishing this end, such as incorporation of the StAR gene, 

O or fragment thereof, into an adenovirus, retrovirus, or other techniques known to those of skill in 

20 the art in light of the present disclosure. Ex vivo gene therapy is also contemplated as another 
mode of administration. 

Such preparations should contain at least 0.1% of active compound. The percentage of 
the compositions and preparations may, of course, be varied and may conveniently be between 
25 about 2 to about 60% of the weight of the unit. The amount of active compounds in such 
therapeutically useful compositions is such that a suitable dosage will be obtained. 

The active compounds may be administered parenterally or intraperitoneally. Solutions 
of the active compounds as free base or pharmacologically acceptable salts can be prepared in 

-25- 



water suitably mixed with a surfactant, such as hydroxypropylcellulose. Dispersions can also be 
prepared in glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. Under 
ordinary conditions of storage and use, these preparations contain a preservative to prevent the 
growth of microorganisms. 

5 

The pharmaceutical forms suitable for injectable use include sterile aqueous solutions or 
dispersions and sterile powders for the extemporaneous preparation of sterile injectable solutions 
or dispersions. In all cases the form must be sterile and must be fluid to the extent that easy 
syringability exists. It must be stable under the conditions of manufacture and storage and must 
1 0 be preserved against the contaminating action of microorganisms, such as bacteria and fungi. 

The carrier can be a solvent or dispersion medium containing, for example, water, ethanol, polyol 
O (for example, glycerol, propylene glycol, and liquid polyethylene glycol, and the like), suitable 
jM mixtures thereof and vegetable oils. The proper fluidity can be maintained, for example, by the 
L~ use of a coating, such as lecithin, by the maintenance of the required particle size in the case of 
t§ dispersion and by the use of surfactants. The prevention of the action of microorganisms can be 
k brought about by various antibacterial ad antifungal agents, for example, parabens, 
C] chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In many cases, it will be preferable 
J? to include isotonic agents, for example, sugars or sodium chloride. Prolonged absorption of the 
O injectable compositions can be brought about by the use in the compositions of agents delaying 
f5 absorption, for example, aluminum monostearate and gelatin. 

Sterile injectable solutions are prepared by incorporating the active compounds in the 
required amount in the appropriate solvent with various of the other ingredients enumerated 
above, as required, followed by filtered sterilization. Generally, dispersions are prepared by 
25 incorporating the various sterilized active ingredients into a sterile vehicle which contains the 

basic dispersion medium and the required other ingredients from those enumerated above. In the 
case of sterile powders for the preparation of sterile injectable solutions, the preferred methods of 
preparation are vacuum-drying and freeze-drying techniques which yield a powder of the active 
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ingredient plus any additional desired ingredient from a previously sterile-filtered solution 
thereof. 

As used herein, "pharmaceutical^ acceptable carrier ,, includes any and all solvents, 
5 dispersion media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying 
agents and the like. The use of such media and agents for pharmaceutical active substances is 
well known in the art. Except insofar as any conventional media or agent is incompatible with 
the active ingredient, its use in the therapeutic compositions is contemplated. Supplementary 
active ingredients can also be incorporated into the compositions. See, for example, Remington 

1 0 (1 995), which reference is incorporated by reference herein. 

yg Antibodies. 

rj In another aspect, the present invention includes an antibody that is immunoreactive with 

11 a StAR polypeptide as described for the invention. An antibody can be a polyclonal or a 
S monoclonal antibody. In some embodiments, the antibody is a monoclonal antibody. In 

T addition, the antibodies may comprise recombinant antibodies and may be obtained employing 
H the information provided here on StAR in conjunction with those techniques well know to those 
of skill in the art. These antibodies may further be described as chimeric recombinant antibodies, 
r\ particularly humanized chimeric antibodies specific for StAR. Means for preparing and 
% characterizing antibodies are well known in the art (See, e.g., Antibodies "A Laboratory Manual, 
E. Howell and D. Lane, Cold Spring Harbor Laboratory, 1988). A preferred polyclonal antibody 
has binding specificity for amino acids 1-26, 10-26, 36-47, or 88-98 of SEQ ID NO:2, 

In addition, the antibodies may comprise recombinant antibodies, and may be obtained 
25 employing the information provided here on StAR in conjunction with those techniques well 
known to those of skill in the art. These antibodies may further be described as chimeric 
recombinant antibodies, particularly humanized chimeric antibodies specific for StAR. See, for 
example, Munro (1984) Nature 312:597; U.S. Patent No. 5,225,599. 
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Briefly, a polyclonal antibody is prepared by immunizing an animal with an immunogen 
comprising a polypeptide of the present invention and collecting antisera from that immunized 
animal. A wide range of animal species can be used for the production of antisera. Typically an 
animal used for production of anti-antisera is a rabbit, a mouse, a rat, a hamster or a guinea pig. 
5 Because of the relatively large blood volume of rabbits, a rabbit is a preferred choice for 
production of polyclonal antibodies. 

Antibodies, both polyclonal and monoclonal, specific for the peptides of the present 
invention may be prepared using conventional immunization techniques, as will be generally 

10 known to those of skill in the art. A composition containing antigenic epitopes of the peptide 
sequences, isolated peptides, or fragments thereof can be used to immunize one or more 

5 experimental animals, such as a rabbit or mouse, which will then proceed to produce specific . 

tl antibodies against StAR. Polyclonal antisera may be obtained, after allowing time for antibody 

2f generation, simply by bleeding the animal and preparing serum samples from the whole blood, 

ii 

; In one embodiment, the polyclonal antibodies to StAR were prepared by injecting a 

J] rabbit with a StAR peptide 88-98 (SEQ ID NO:8) conjugated to KLH (keyhole limpet 

iM hemocyanin) mixed with a StAR peptide 88-98 modified according to the MAP procedure (with 

5 a branched lysine core) (See Tam et al (1989) J. of Immun Methods, 123:53-61) incorporated 

20r herein by reference. Prior attempts to raise antibody using as immunogen the mouse StAR 
peptide, 97-107 was less successful generating antibody. 

To obtain monoclonal antibodies, one would also initially immunize an experimental 
animal, often preferably a mouse, with a purified peptide composition. One would then, after a 
25 period of time sufficient to allow antibody generation, obtain a population of spleen or lymph 

cells from the animal. The spleen or lymph cells can then be fused with cell lines, such as human 
or mouse myeloma strains, to produce antibody-secreting hybridomas. These hybridomas may 
be isolated to obtain individual clones which can then be screened for production of antibody to 
the desired peptide. 
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Following immunization, spleen cells are removed and fused, using a standard fusion 
protocol (see, e.g., The Cold Spring Harbor Manual for Hybridoma Development, incorporated 
herein by reference) with plasmacytoma cells to produce hybridomas secreting monoclonal 
antibodies against the desired peptide. Hybridomas which produce monoclonal antibodies to the 
selected antigens are identified using standard techniques, such as ELIS A and Western blot 
methods. 

Hybridoma clones can then be cultured in liquid media and the culture supernatants 
purified to provide the peptide-specific monoclonal antibodies. In general, monoclonal 
antibodies to the peptide antigen can be used in the identification of steroid hormone-dependent 
disorders. It is proposed that the monoclonal antibodies of the present invention will find useful 
application in standard immunochemical procedures, such as ELIS A and Western blot methods, 
as well as other procedures which may utilize antibody specific to common or allelically distinct 
peptide epitopes. 

Monoclonal and polyclonal antibodies raised against peptides or protein of the present 
examples are useful for (1) screening a cDNA expression library in the process of cloning a gene 
that encodes a particular protein or related protein (for example, the SUPERSCREEN® 
immunoscreening system from AMERSHAM®), (2) facilitating the purification of a particular 
protein or related protein by using column chromatography to which the monoclonal antibody is 
bound, and (3) providing reagents necessary for a diagnostic immunoassay for screening 
biological samples. 

Monoclonal antibodies are obtained using the following procedure: 

Immunization Schedule for Raising Monoclonal Antibodies 

1 . For each mouse, mix 250 (A of antigen solution containing 10 \ig of antigen with 250 yX 
of complete Freund's adjuvant. Inject six BALB/c female mice ip (intraperitoneal 
injection). 
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2. After 14 days, repeat the injections of antigen and incomplete Freund's adjuvant. 

3. Collect tail bleeds from immunized mice on day 24. Do 1 in 5 dilutions in phosphate 
buffered saline (PBS) and test all samples by comparison with similar dilutions of normal 
mouse serum in a dot blot. 

4. On day 35, inject all animals ip with antigen and incomplete Freund's. 

5. Day 45, do tail bleeds and test by dot blot. All serum samples checked by 
immunoprecipitation against in vivo radiolabeled antigen preparation. 

6. Day 56, inject best responder, 100 (A iv and 100 iA ip. All others get ip injection with 
incomplete Freund's. 

7. Day 59, fuse splenocytes from best responder. 

In still another embodiment of the invention, a hybridoma cell line which produces a 
monoclonal antibody which specifically binds StAR protein is provided. Most particularly, the 
hybridoma cell line is an animal hybridoma cell line prepared by a process of immunizing an 
animal, such as a mouse or a rat, with StAR protein, isolating anti-StAR antibody producing cells 
from the immunized animal, and fusing the antibody producing cells with a neo-plastic animal 
cell line to obtain a hybridoma cell line. The resultant hybridoma tissue culture supernatants are 
screened for monoclonal antibodies as follows: 



1 . A protein solution of at least 1 pg/ml of antigen is added to a nitrocellulose sheet at 0. 1 
ml/cm 2 . Allow the protein to bind to the paper for 1 hr. Higher concentrations of 
proteins will increase the signal and make screening faster and easier. If the amount of 
protein is not limiting, concentrations of 10-50 t*g/ml should be used. Nitrocellulose can 
bind approximately 100 fxg of protein per cm 2 . 

2. Wash the nitrocellulose sheet three times in PBS. 

3. Place the sheet in a solution of 3% BSA in PBS with 0.02% sodium azide for 2 hr to 
overnight To store the sheet, wash twice in PBS and place at 4°C with 0.02% sodium 
azide. For long-term storage, shake off excessive moisture from the sheet, cover in 
plastic wrap, and store at -70°C. 
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4. Place the wet sheet on a piece of parafilm, and rule with a soft lead pencil in 3-mm 
squares. Cut off enough paper for the number of assays. 

5. Apply 1 /A of the hybridoma tissue culture supernatant to each square. Incubate the 
nitrocellulose sheet on the parafilm at room temperature in a humid atmosphere for 30 
min. 

Along with dilutions of normal mouse serum, include dilutions of the mouse serum from 
the last test bleed as controls. Dilutions of the test sera are essential to control correctly 
for the strength of the positive signals. Mouse sera will often contain numerous 
antibodies to different regions of the antigen and therefore will give a stronger signal than 
a monoclonal antibody. Therefore, dilutions need to be used to lower the signal. Good 
monoclonal antibodies will appear 10-fold less potent than good polyclonal sera. 

6. Quickly wash the sheet three times with PBS, then wash two times for 5 min each with 
PBS. 

7. Add 50,000 cpm of l25 I-labeied rabbit anti-mouse immunoglobulin per 3-mm square in 
3% BSA/PBS with 0.02% sodium azide (about 2.0 ml/cm 2 ). 

8. After 30-60 min of incubation with shaking at room temperature, wash extensively with 
PBS until counts in the wash buffer approach background levels. 

9. Cover in plastic wrap and expose to X-ray film with a screen at -70°C. 

The hybridoma identified as producing antibody to the protein of interest is passaged as 
follows: 



1 . Inject 1 0 7 (or less) cells into female mice that have been injected ip about 1 week earlier 
with 0,5 mi of pristane or incomplete Freund's adjuvant These types of injections are 
also used to prime mice for ascites production, and this may serve as a convenient source 
of appropriate hosts. If no mice are available, inject mice with incomplete Freund's 
adjuvant and wait 4 hr to 1 day before injecting the hybridoma cells. The animals must 
be of the same genetic background as the cell line. 

2. If an ascites develops, tap the fluid and transfer into a sterile centrifuge tube. 

3 . Spin the ascites at 400g for 5 min at room temperature. 
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4. Remove the supernatant, Resuspend the cell pellet in 10 ml of medium supplemented 
with 10% fetal bovine serum and transfer to a tissue culture plate. The supernatant can be 
checked for the presence of the antibody and used for further work if needed. 

5. Handle as for normal hybridomas, except keep the cells separate from the other cultures 
until there is little chance of the contamination reappearing. 

The present invention in still another aspect defines an immunoassay for the detection of 
a StAR protein in a biological sample. In one particular embodiment of the immunoassay, the 
immunoassay comprises; preparing an antibody having binding specificity for StAR protein to 
provide an anti-StAR antibody, incubating the anti-StAR antibody with the biological sample for 
a sufficient time to permit binding between antibody and StAR present in said biological sample, 
and determining the presence of bound antibody by contacting the incubate of the sample and 
antibody with a detectably labeled antibody specific for the anti-StAR antibody, wherein the 
presence of anti-StAR antibody in the biological sample is detectable as the measure of the 
detectably labeled antibody from the biological sample. In some embodiments, the antibody is 
preferably a monoclonal antibody having binding specificity for the StAR amino acid sequence 
88-98 of SEQ ID NO:2 or a region of identical amino acid sequence in SEQ ID NO. 18 (See 
FIG. 5). 

By way of example, the antibody may be labeled with any of a variety of detectable 
molecular labeling tags. Such include, an enzyme-linked antibody, a fluorescent-tagged 
antibody, or a radio-labelled antibody. 

A further embodiment of the invention is a method for detecting a chromosomal genetic 
lesion comprising the steps of i) preparing a nucleic acid probe having a nucleotide sequence that 
includes at least a 17-base segment of SEQ ID NO: 1 or 19 corresponding to a region of a StAR 
genetic lesion in a diseased patient sample; corresponding to a region of a StAR genetic lesion in 
a diseased patient sample; ii) contacting a chromosomal sample with the probe to allow 
hybridization of the sample to the probe under conditions standard for hybridization fidelity and 



-32- 



stability, wherein lack of specific hybridization of the probe and the chromosomal sample 
provides for detection of a potential genetic lesion in the chromosome. The genetic lesion may 
be a deletion, a rearrangement, an insertion, a transition, a transversion, a frameshift, a missense 
or a nonsense mutation. In particular, the genetic lesion correlates with the presence of lipoid 
5 congenital adrenal hyperplasia. Human tissue samples may be biopsy material from adrenal 
tissue, gonadal tissue, or blood* 

In another aspect of the invention, a screening method for lipoid congenital adrenal 
hyperplasia is provided. The method comprises the steps of i) obtaining a chromosomal sample 
|3 to provide a test sample; ii) preparing a nucleic acid probe having a nucleotide sequence 
'% essentially as set forth in SEQ ID NO: 1 or 1 9; and, iii) contacting the test sample with the nucleic 
^ acid probe under hybridization conditions allowing for detection of a mismatch in a hybridizing 
ffi molecule as a screening method for lipoid congenital adrenal hyperplasia, A mismatch is 
% determined most readily by determining the nucleotide sequence of the hybridizing molecule, a 
15 difference in the nucleotide sequence of the hybridizing molecule and the nucleotide sequence of 
%l SEQ ID NO: 1 or 19 provides a screen for lipoid congenital adrenal hyperplasia. 

2 Further embodiments of the invention include; a method for stimulating cholesterol 

transport, a method for increasing production of progesterone, and a method for increasing 

20 steroidogenesis; these methods comprise administering a pharmacologically effective amount of 
steroidogenic acute regulatory protein having an amino acid sequence essentially as set forth in 
SEQ ID NO:2 or 18. Progesterone is used clinically in a variety of applications in males and 
females. Methods for providing enhanced production of progesterone are thus a valuable 
application of the StAR compositions of the present invention. The protein may be delivered by 

25 recombinant means, i.e., synthesis from an expression vector containing nucleic acid sequences 
encoding the protein. 

Following long-standing patent law convention, the terms "a" and "an" mean "one or 
more" when used in this application, including the claims. 
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Abbreviations 

Bt 2 cAMP : N 6 , 2 , -0-dibutyryladenosine-3 , :5 , -cyclic 
monophosphate 

CHAPS : 3-[3-cholamidopropyl dimethylammonio] 1 

propanesulfonate 
CSCC : cholesterol side-chain cleavage 

DBI diazepam binding inhibitor 

hCG : human chorionic gonadotropin 

HPLC : high performance liquid chromatography 

IOD : integrated optical density 

LH : luteinizing hormone 

< 

Mops : 3-[N-Morpholino]propane-sulfonic acid 

PAGE : polyacrylamide gel electrophoresis 

PBR peripheral benzodiazepine receptor 

PBS : Dulbecco's phosphate-buffered saline with 

calcium and magnesium 
PCR polymerase chain reaction 

SAP : steroidogenesis activator polypeptide 

SCP 2 : sterol carrier protein 2 

WAY : Waymouth's MB 752 media 



BRIEF DESCRIPTION OF THE DRAWINGS 

The following drawings form part of the present specification and are included to further 
demonstrate certain aspects of the present invention. The invention may be better understood by 
reference to one or more of these drawings in combination with the detailed description of 
specific embodiments presented herein. 
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FIG. 1 . Restriction map of the isolated cDNA clone encoding the 37 kDa precursor protein. 
The initiating ATG codon and the termination codon, TAA, are indicated by vertical bars along 
with several unique endonuclease restriction sites. The arrowheads indicate the position of the 
degenerate oligonucleotides designed from peptides 25 and 23, and the hatched bar denotes the 
5 region amplified by PCR using these oligonucleotides. The PCR amplified product was used to 
screen the cDNA library. Both the coding and noncoding strand of the cDNA were sequenced. 
The solid lines shown beneath the map represent the overlapping fragments of the coding strand 
which were sequenced while the dashed lines represent the noncoding strand fragments which 
were sequenced by the dideoxynucleotide sequencing method of Sanger et al (1977) using the 
1 0 Sequenase Version 2 kit (United States Biochemical). 

?; FIG. 2. Nucleotide sequence of the 37 kDa cDNA clone (SEQ ID NO:l) and the deduced amino 
HI acid sequence for the protein (SEQ ID NO:2). The nucleotides are numbered to the left with 
5 position 1 as the first nucleotide in the cDNA and the amino acids are numbered to the right with 
f 5 amino acid 1 being the initiating methionine of the mitochondrial precursor. Capital letters are 
O used for the coding region of the cDNA and to denote the in-frame stop codon in the 5' 
M untranslated region and the polyadenylation signal and poly (A) + tail in the 3' untranslated region. 
% The amino acids corresponding to peptides 23, 25, and 45 are underlined and the corresponding 
O peptide is indicated below the line. The underlined nucleotide sequences were included in the set 
20 of degenerate oligonucleotides designed from the amino acid sequence. The antibody was raised 
against amino acids 88-98 which includes most of peptide 25. The mature protein begins with 
the methionine at position 48 (see Example 2 regarding cleavage of the precursor protein). 

FIG. 3 . Nucleic acid sequence of human StAR and mouse StAR. Segments of high 
25 identity of nucleic acid sequence are shown. 

FIG. 4. Effect of KN93 on agonist-stimulated aldosterone production and StAR protein 
expression in H295R cells. Cells were incubated for six hours with Ang II (lOOnM), K+(16 
|iM), and Bay K(l^iM) in the presence or absence of KN93 (3*iM). The medium aldosterone 
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content was determined by R.L A. and normalized to the tissue culture well protein content Data 
points represent the mean ±SE of values from six separate culture wells expressed as fold 
increase over basal. Significant inhibition relative to the control response is indicated at 
*p<0.01 . StAR protein was examined using equivalent amounts of total cellular lysate (30|ig) by 
immunoblot analysis as described in the Methods. Results are representative of those obtained 
from three separate experiments. 

FIG. 5. Comparison of the deduced amino acid sequences from known nucleotide 

sequences of the putative coding region of StAR protein from mouse, human, bovine and ovine. 
Only a partial sequence is presented for ovine- Putative conserved phosphorylation sites that 
correspond to consensus motifs are highlighted for PKA and CAM kinase II (A), PKC (C), CKII 
(CK), and P34CDC2 kinase (CD). The proposed cleavage sites for the mitochondrial proteases, 
matrix-processing protease, and mitochondrial intermediate-processing peptide are noted (CI and 
C2); however, only the mouse sequence strictly follows the consensus motif for two cleavages 
with the other sequences predicting one cleavage. Another putative PKA phosphorylation site is 
found in bovine and human sequences at position 277. While Ser 277 is not conserved in the 
murine sequence, there is a Ser at 278. If phosphorylation is crucial to the function of StAR 
protein and this Ser is the residue that is phosphorylated, then this may explain how the C- 
terminal deletion mutants found in lipoid CH patients affect StAR function. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present inventors have utilized the MA- 10 mouse Leydig tumor cell line to study the 
acute regulation of steroidogenesis. Specifically, the inventors have identified a family of 
hormone-induced mitochondrial proteins in MA- 10 cells that regulate cholesterol delivery to the 
inner mitochondrial membrane and the CSCC. These proteins have been described as the 
mitochondrial 37kDa, 32kDa, and 30kDa molecular weight proteins and they are synthesized in 
response to either LH and HCG or by stimulation with the CAMP analogue, Bt 2 cAMP. The 
30kDa species consists of four separate proteins and proteolytic digestion of all four forms 
indicates that they are all modified forms of the same protein. Pulse chase experiments and 
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tryptic peptide mapping of the 37kDa and 30kDa proteins indicated that the 37kDa fonn is a 
precursor to the 30kDa protein. 

The following data support the involvement of these proteins in the acute regulation of 
5 steroidogenesis: i) their synthesis is directly correlated to the capacity of the MA- 10 cells to 
produce steroid in response to hormone stimulation in both a time and dose responsive manner; 
ii) their synthesis is sensitive to cycloheximide; iii) the 30kDa proteins are localized to the 
mitochondria and are processed from a larger precursor protein of 37kDa; iv) a rat Leydig tumor 
cell line (R 2 C) which constitutively produces steroids constitutively expresses the 30kDa 

14 proteins; and v) inhibition of steroidogenesis is concomitant with inhibition of synthesis of 
tff 30kDa proteins (Stocco and Kilgore, 1988; Stocco and Chaudhary, 1990; Stocco and Chen, 

u 1991; Stocco and Sodeman, 1991; Stocco, 1992; Stocco and Ascoli, 1993; Stocco et a/., 1993; 
% Stocco and Clark, 1993). 

15 The present inventors have now cloned the cDNA for this family of proteins, and have 
Ci also more directly determined their function in the regulation of steroid production, particularly 

in MA- 10 cells. 

The following examples describe the purification of the MA- 10 30kDa proteins and the 
20 isolation and characterization of a full length cDNA clone. The cDNA encodes a novel mouse 
protein of 3 1 .6 kDa, which relates to the previously described 37kDa precursor protein of the 
LH-induced family of mitochondrial proteins. The amino acid sequence at the amino terminus 
has been identified by the present inventors to be characteristic of a mitochondrial targeting 
signal. Using an in vitro transcription/translation system, the precursor protein was processed 
25 and modified to all forms of the 30kDa proteins by isolated mitochondria. Immunobiot analysis 
of mitochondria isolated from either Bt 2 c AMP-stimulated MA- 1 0 cells or MA- 1 0 cells 
transfected with the cDNA confirmed the cDNA encodes the same immunospecific 30kDa 
protein. In addition, in the absence of hormone stimulation, expression of the 30kDa protein in 
MA- 10 cells resulted in a 1.5-3.5 increase in progesterone production above cells transfected 
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with vector alone, thus, the present inventors have demonstrated for the first time that 
expression of the LH-inducibie 30kDa protein directly results in an increase in steroid 
biosynthesis. This protein is required in the acute regulation of hormone-induced 
steroidogenesis. 

Materials & Methods. The materials and methods used in the following examples are 
provided here. In light of these teachings, one of skill in the art would realize that other 
equivalent materials and methods may be used in the present invention. 

Chemicals. Waymouth's MB 752 medium, Dulbecco's modified Eagle's medium, horse 
serum, fetal bovine serum, antibiotics, and PBS were purchased from Life Technologies, Inc. 
(Gaithersburg, MD). Bt 2 cAMP, leupeptin, aprotinin, phenylmethylsufonyl fluoride, 
formaldehyde, and Escherichia coli alkaline phosphatase were obtained from Sigma (St. Louis, 
MO). Silver nitrate was from Fisher (Houston, TX). The ampholines and stock solutions of 
nucleic acids were- purchased from Pharmacia Biotech Inc. (Piscataway, NJ). Restriction 
endonucleases, T7 RNA polymerase, RNAsin, and Taq DNA polymerase were purchased from 
Promega (Madison, WI). Radiolabeled nucleotides [ 32 -P] CTP and [ 35 S]-methionine were 
obtained from Du Pont NEN (Boston, MA). Oligonucleotides were synthesized and purified by 
Midland Certified Reagent Co. (Midland, TX). 

Maintenance of MA- 10 and COS 1 cells . The MA-10 mouse Leydig tumor cell line was 
from Dr. M. Ascoli (Dept of Pharmacology, Univ. of Iowa College of Medicine, Iowa City, 
Iowa). These cells were derived from the M5480P tumor, they have functional LH/CG receptors 
and produce large amounts of progesterone rather than testosterone in response to hormone 
stimulation. The cells were grown in Waymouth's MB/752 media containing 15% horse serum 
(WAY*) at 37°C in a humid atmosphere under 5% C0 2 and maintained in culture using standard 
techniques (Ascoli, 1981). COS 1 cells were obtained from the American Type Culture 
Collection (#CRL 1650) and grown in Dulbecco's modified Eagle's medium high glucose media 

-38- 



containing 10% fetal bovine serum and 100 units of penicillin/ml and 10 units of streptomycin 
sulfate/ml. 

Isolation of Mitochondria. MA- 10 cells were stimulated for 6 h with ImM Bt 2 cAMP in 
Waymouth's media containing 5% horse serum, then washed once with PBS (Life Technologies, 
Inc.) and collected in 0.25 M sucrose, 10 mM Tris, pH 7.4, 0.1 mM EDTA by scraping with a 
rubber policeman. The cells were lysed by homogenization at 1000 rpm for 25 passes with a 
Potter Elvehjem homogenizer fitted with a serrated Teflon pestle. The homogenate was 
centrifuged at 600 x g for 30 min, and the resultant supernatant was centrifuged at 9000 x g for 
30 min to pellet the mitochondria. The mitochondrial pellets were stored frozen at -80°C until 
used to purify the 30-kDa proteins. 

For the in vitro translation reactions, mitochondria were isolated as above with the 
following exceptions; MA- 10 cells were not stimulated with hormone, and the cells were lysed 
using a glass-on-glass Dounce homogenizer with fitted pestle and homogenized by hand for 25 
passes. The mitochondrial pellet was washed once with import buffer (3% bovine serum 
albumin, 70mM KC1, 220 mM sucrose, 10 mM Mops/KOH (pH 7.2), 2.5 mM MgCl 2 (Hartl, 
1986) then resuspended in 200 fA of the import buffer to a protein concentration of 7.5 mg/ml. 
Mitochondria were used immediately after isolation for the in vitro translation reaction. Isolation 
of mitochondria from COS 1 cells for immunoblot analysis was as described (Clark and 
Waterman, 1991). 

Purification of the 30-kDa Proteins from Isolated Mitochondria In general, mitoplasts 
(mitochondria stripped of outer membrane) were purified from isolated mitochondria and 
solubilized with CHAPS detergent Preparative one-dimensional SDS-PAGE gels were used to 
isolate CHAPS soluble proteins of 28-32 kDa in size (30-kDa fraction), and the 30-kDa proteins 
were isolated and recovered by two-dimensional SDS-PAGE of the 30-kDa fraction. A more 
detailed description of this purification is as follows. 
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Preparation of Mitoplasts and CHAPS Solubilization Mitoplasts were prepared using 
the methods detailed by Ardail et ai (1991). Mitochondria were resuspended in swelling buffer 
(10 mM potassium phosphate, pH 7.4, 0.1 mM EDTA, 1 mM phenylmethylsulfonyi fluoride, 0.1 
mg leupeptin/ml, and 0.04 units aprotinin/ml) to a protein concentration of 2.5 mg/ml, incubated 
5 on ice for 20 min, then homogenized 5 times by hand using a glass Dounce homogenizer fitted 
with a Teflon pestle. An equal volume of swelling buffer containing 30% sucrose was added to 
the shocked mitochondria and mixed thoroughly. Mitoplasts were recovered by centrifugation at 
12,000 x g for 30 min, resuspended in swelling buffer to a protein concentration of 2.5 mg/ml, 
and solubilized by adding a fresh solution of swelling buffer containing 25% CHAPS to achieve 
10 a 1 : 1 mg protein to mg detergent ratio (0.25% CHAPS final concentration). The CHAPS soluble 
'% sample was recovered after a 1 00,000 x g centrifugation for 45 min and concentrated under 
51 nitrogen pressure using a Filtron stir-cell with 1 0,000 molecular weight cut off (Pharmacia 
m Biotech Inc.). Protein was determined for each fraction by the method of Bradford (1 976). 

f§ Preparative One-dimensional SDS-Polyacrylamide Gel Electrophoresis. The CHAPS 

O soluble sample was separated on a preparative 1.5 mm 12.5% SDS-polyacryiamide gel 

hL (Laemmli, 1 970). A 5 mm section of the polyacrylamide gel corresponding to 28-32kDa band 

^ was excised, and the proteins were electroeluted from the gel and concentrated using the 

O Centrilutor microelectroeluter system (Amicon). The position of the 5 mm strip was determined 

20 by running a reference lane containing molecular weight markers and cutting the lane from the 
gel and staining the markers with Coomassie blue. 

Two-Dimensional Polyacrylamide Gel Electrophoresis. Approximately 150-250 jx% of 
the 30-kDa fraction (the concentrated 28-32kDa proteins from the one-dimensional gel) was 
25 resolved by two-dimensional PAGE (O'Farrell, 1975), and the proteins were electrophoretically 
transferred to nitrocellulose in 20 mM Tris/Cl pH 7.4, 150 mM glycine, 10% p-mercaptoethanol, 
0.01% SDS for 4 h at 350 mA (Deutscher, 1990; Towbin, et al 1979). The nitrocellulose was 
transiently stained with Ponseau S (0.2% in 1% acetic acid) to visualize and isolate the specific 
30 kDa proteins for subsequent amino acid microsequence analysis. The filters were destained 
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with 1% acetic acid, and washed thoroughly with HPLC grade water. The nitrocellulose spots 
were stored damp at -80°C until shipped to the Harvard Microchemical facility where the in situ 
digestion, peptide separation, and microsequence analysis was performed on a fee-for service 
basis. 

Quantitation of Silver Stained Proteins. The two-dimensional SDS-PAGE gels were 
fixed in 50% methanol, 12% acetic acid for 1 h, then washed with 50% EtOH 3 times for 20 min 
with each wash. To silver stain the proteins, the gel was pretreated with 0.02% sodium 
thiosulfate for 1 min, rinsed 3 times with H 2 0 for 20 s each rinse, then treated with 0.2% silver 
titrate, 0.02% formaldehyde for 20 min. After the silver nitrate impregnation, the gel was rinsed 
with H 2 0 twice and developed with 6% sodium carbonate, 0.02% formaldehyde. The silver- 
stained image was captured, and the integrated optical densities (IOD) of the proteins were 
quantitated using the Biolmage Visage 2000 computer-assisted image analysis system 
(Biolmage, Ann Arbor, MI). The percent of the total IOD of each spot (protein) was determined 
automatically and used to quantitate the 30-kDa protein. For example, the percent of total IOD 
for 30-kDa protein 1 x mg total protein loaded onto the first dimension gel/mg total protein for 
the fraction - mg 30-kDa protein 1 in that fraction. 

Preparation of the Bt£ AMP -stimulated MA- 10 Mouse Leydig Tumor Cell cDNA library. 
Total RNA was isolated from 6-h Bt 2 cAMP-stimulated MA- 10 cells by a one-step extraction 
adapted from the methods of Chomczynski and Sacchi (1987) using RNA STAT-60 (Tel-Test B, 
Inc., Friendswood, TX), Poly A + mRNA was twice selected on a gravity flow oligo-dT column 
(5 Prime-3 Prime, Inc.). A Xgt22A cDNA library was constructed from the polyA* RNA using 
the Superscript Lambda System for cDNA synthesis and X cloning (Life Technologies, Inc.). 
Briefly, first strand synthesis was generated using a Notl Primer- Adapter and Superscript RT, an 
engineered Moloney murine leukemia virus reverse transcriptase. Second strand synthesis was 
generated by nick translation replacement of the template mRNA and a Sail adapter was ligated 
to the cDNA ends. The cDNAs were digested with Notl and Sail restriction enzymes and cloned 
into Xgt22A. The DNA was packaged in vitro using the k Packaging System (Life Technologies, 
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Inc.) and the recombinant phages were stored in 50 mM Tris/Cl pH 7.5, 100 mM NaCl, 1 mM 
MgS0 4 , 0.01% gelatin and CHC1 3 at 4°C. The cDNA library contained 9 x 10 5 independent 
clones. The E. coli strain Y1090r (Life Technologies, Inc.) was infected with the stock phage 
solution, and the library was amplified to a titer of 2 x 10 10 plaque-forming units/ml before the 
5 initial screen. 

Cloning the 30kDa cDNA. Standard methods were used to purify bacteriophage k 
particle from the amplified cDNA library and to extract the recombinant DNAs (Sambrook, et 
al 9 1989). Degenerative oligonucleotides were designed based on the amino acid sequences 
JO from peptide 23 and peptide 25 and used for primer-directed amplification of the DNA isolated 
yB from the library. Since the position of the peptides relative to each other within the protein was 
Us, not known, both the coding and reverse complement sequences were synthesized. The coding 
ijf and reverse complement sequences for peptide 23 used were 5'-GCN GAR CAY GGN CCN 
i ACN TGY ATG G-3' (SEQ ID NO:9) and 5'-C CAT RCA NGT NGG NCC RTG YTC NGC-3' 
% (SEQ ID NO: 1 0), respectively, and for peptide 25 were 5'- AAY CAR CAR GGN TGG AA-3' 
3 (SEQ ID NO: 1 1) and 5'-TTC CAN CCY TCY TGR TT-3' (SEQ ID NO: 12), respectively. In 
H these designations, N is inosine, R is A or G, Y is T or C. The corresponding sequences are 
5 underlined in FIG. 2. The oligonucleotides were synthesized and purified by HPLC by Midland 
u Certified Reagent Co. (Midland, TX). Conditions for amplification of the DNA by the 
20 polymerase chain reaction were 20 mM Tris/Cl pH 8.4, 50 mM KC1, 2.5 mM MgCl 2 , 0. 1 mg/ml 
bovine serum albumin, 10 mM concentration of each dATP, dCTP, dTTP, dGTP, 50 pmol of 
each primer, 2.5 units Taq DNA polymerase, and 1 /zg cDNA (Saiki, et al, 1988). Thirty cycles 
of amplification were performed at 92°C for 1 min, 45°C for 30 s, and 72°C for 30 s. A specific 
PCR product of 400 base pairs was amplified from the isolated cDNAs and subcloned into the 
25 Smal site of Bluescript KS' (Stratagene, La Jolla, CA). An [a 32 P]-CTP-labeled riboprobe was 
synthesized and used to screen the cDNA library. Solution hybridization and stringent washing 
was performed using standard procedures (Sambrook, et ai, 1989). One positive clone was 
plaque-purified from an initial screen of 1 x 10 6 clones. The cDNA was directionally subcloned 
into the prokaryotic expression vector, pSPORT 1 (Life Technologies, Inc.), using the Sail and 
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Notl cloning sites. A series of nested deletions were constructed (Erase-A-Base kit, Promega, 
Madison, WI) to generate overlapping clones from both the 5 ! and 3' ends of the cDNA. Both 
strands of the cDNA were sequenced by the dideoxynucleotide sequencing method of Sanger 
using the Sequenase Kit Version 2 (United States Biochemical Corp., Cleveland, Ohio) (Sanger, 
et al, 1977). Electrophoresis was performed in an 8% polyacrylamide gei (Hydrolink, AT 
Biochem, Malvern, PA) with 8 M urea and 25% formamide (v/v). The regions sequenced are 
indicated in FIG. 1. 

In vitro Transcription/Translation of the Cloned cDNA. Notl linearized pSPORT 
1/cDNA template (2.5^g) was transcribed in a 100 reaction containing 40 mM Tris/Cl (pH 
7.5), 6 mM MgCl 2 , 2 mM spermidine, 10 mM NaCl, 10 mM dithiothreitol, 100 units RNAsin, 
0.5 mM of each UTP, CTP, GTP, ATP, and 40 units T7 RNA polymerase for 2 h at 37°C. In 
vitro translation reactions were performed in parallel and included either 15 /xg of isolated 
mitochondria alone or mitochondria plus 4 /zg of the transcribed RNA. A rabbit reticulocyte 
lysate kit was used following the instructions of the manufacturer (Du Pont-NEN, Boston, MA). 
The proteins were synthesized in the presence of [ 35 S]-methionine for 1 h at 37°C and the 
reactions were frozen at -20°C. An equal volume of each reaction was analyzed by two- 
dimensional SDS-PAGE as described above. The gels were prepared for fluorography using 
Resolution (E.M. Corp., Chestnut Hill, MA), dried under moderate heat and vacuum, and 
exposed to x-ray film at -80°C. 

Expression of the 30-kDa Protein in MA-I0 Cells and COS I Cells. The full length 
SaR-Notl 1456 base pair 30-kDa cDNA was subcloned into the eukaryotic expression vector, 
pCMV (Andersson, et al, 1989). MA- 10 cells were transfected with DNA by a liposome- 
mediated uptake using the LipofectAMINE reagent (Life Technologies, Inc., Gaithersburg, MD) 
(Hawley-Nelson, et al, 1993). Plasmid DNA used in transfection experiments was purified by 
CsCl density gradient followed by polyethylene glycol precipitation. The DNA was mixed with 
1/10 the final volume of Waymouth's media minus serum and minus antibiotics (WAY") and 
added to an equal volume of WAY" media containing the LipofectAMINE reagent The 
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DNA/lipid solution was gently mixed and incubated for 30 min at room temperature, then WAY" 
media was added to achieve the final concentration of the DNA and LipofectAMINE reagent of 5 
^g/ml and 20 ^g/ml respectively. The cells were washed once with WAY"* media, incubated 
with transfection mix for 6 h, washed once with PBS, then incubated with Waymouth f s plus 15% 
horse serum. The same procedure was used for transfection of COS 1 cells except Dulbecco's 
modified Eagle's media minus serum and antibiotics was used for the transfection media. For 
isolation of mitochondria for immunoblot analysis, cells were grown on 100-mm dishes. For 
progesterone production assays, MA- 10 cells were plated into 96 well plates at 75,000 cells/well 
the day before the experiment. Typically, 8 wells each were transfected with either pCMV or 
pCMV+30-kDa cDNA for one experimental set, 

A reporter construct expressing a tartrate-resistant acid phosphatase (TRAP) was used 
to determine the efficiency of transfection of MA- 10 cells by the LipofectAMINE reagent The 
tartrate-resistant acid phosphatase expression plasmid contains the full-length human cDNA 
cloned into the pcDNAl vector provided by Dr. G.D. Roodman (Univ. of Texas HSC, San 
Antonio, Tx) (Reddy, et al, 1993). Forty-eight hours post-transfection the cells were fixed 
directly in the wells with 2% glutaraldehyde, then stained for tartrate-resistant acid phosphatase 
activity using an acid phosphatase staining kit (Sigma Chemical Co., St Louis, MO). Several 
wells of MA- 10 cells were transfected with the tartrate-resistant acid phosphatase expression 
plasmid for each experiment and the positively stain cells were counted by visual inspection 
using an inverted light microscope. Typically, 5-7% of the cells were stained positive for 
tartrate-resistant acid phosphatase expression. 

Immunoblot Analysis. Mitochondria were isolated from either MA- 10 or COS 1 ceils 
transfected with either pCMV or pCMV + 30kDa cDNA 48 hours post-transfection. For 
experiments in which progesterone production was measured (as described above), cells were 
solubilized directly in the well with 0.1% SDS, the ceil homogenates were collected and 
combined from all 8 wells, and the protein was precipitated using trichloroacetic acid. The 
protein was solubilized in sample buffer (25 mM Tris/Cl, pH 6.8, 1% SDS, 5% p- 
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mercaptoethanol, 1 mM EDTA, 4% glycerol, and 0.01% bromophenol blue) and loaded onto a 
12.5% SDS-PAGE minigel (Mini-Protean II System, Bio-Rad, Richmond, CA). Electrophoresis 
was performed at 200 V for 45 min using standard SDS-PAGE buffer as described above, and 
the proteins were electrophoretically transferred to a polyvinylidene difluoride membrane (Bio- 
Rad) at 1 00 V for 2 h at 4° C using the transfer buffer described above. For immunodetection of 
the 30kDa protein, antipeptide antibodies were generated in rabbits against amino acids 88-98 of 
the 30kDa proteins (FIG. 2). The peptide was synthesized and the antibodies were produced in 
rabbits on a fee for service basis by Research Genetics (Huntsville, Alabama). The immunoblot 
procedure was as follows; the membrane was incubated in blocking buffer (PBS buffer 
containing 4% Carnation non-fat dry milk and 0.5% Tween-20) at room temperature for 1 h, then 
incubated in fresh blocking buffer containing the primary label (rabbit sera containing the 
specific antipeptide antibodies) for an additional hour at room temperature. Next, the membrane 
was washed with PBS containing 0.5% Tween-20, 3 times for 10 min each wash, then incubated 
for 1 h at room temperature with fresh blocking buffer containing the secondary antibody, 
donkey anti-rabbit IgG conjugated with horseradish peroxidase (Amersham Life Sciences, 
Arlington Heights, IL). The membrane was washed as before, and the specific signal was 
detected by chemiluminescence using the Renaissance kit from Du Pont-NEN. 

Polyclonal antiserum has been generated to presequences of StAR, in particular, to a 
signal sequence from amino acid 10 to 26 of SEQ ID NO:2, and to a targeting sequence from 
amino acid 36 to 47 of SEQ ID NO:2. Antibodies of these polyclonal antisera were tested and 
immunoreact with precursor forms of StAR. Further polyclonal antiserum was generated to 
signal sequence from amino acid 1 to 26 of SEQ ID NO:2, this antiserum reacts with the signal 
peptide. 

Radioimmunoassay. 48 hours post-transfection, the growth medium was replaced 
with Waymouth's minus horse serum. After 6 h at 37° C, 5% C0 2 , progesterone was measured 
directly in the media by radioimmunoassay as previously described (Resko, et aL, 1974). The 
progesterone antibody was obtained from Holly Hill Biologicals, (Hillsboro, Oregon). 



-45- 



The following examples are included to demonstrate preferred embodiments of the 
invention. It should be appreciated by those of skill in the art that the techniques disclosed in the 
examples which follow represent techniques discovered by the inventor to function well in the 
5 practice of the invention, and thus can be considered to constitute preferred modes for its 

practice. However, those of skill in the art should, in light of the present disclosure, appreciate 
that many changes can be made in the specific embodiments which are disclosed and still obtain 
a like or similar result without departing from the spirit and scope of the invention. 

10 EXAMPLE 1 

0 Purification of 30-kDa Proteins 1 and 2 

!7: The present example provides for the purification of the 30 kDa proteins 1 and 2. The 

W family of LH-inducible proteins previously identified and characterized by the present inventors 
Jt;5 in mitochondria isolated from MA- 10 cells represented approximately 0.2% of the total cell 
p protein and 0.7% of the mitochondrial protein (Table 3). Due to the limited amount of the in 
/f vivo induced protein, the present inventors purified the 30kDa proteins from the mitochondria by 
O enriching a postmitochondrial fraction for the specific 30kDa proteins and separating the 
□ enriched fraction by two-dimensional SDS-PAGE. The proteins were electrophoretically 
20 transferred to nitrocellulose, and the specific 30kDa proteins were isolated by excising the 

nitrocellulose spot containing the bound protein. The purification scheme developed to isolate 
the 30kDa proteins is summarized under "Materials and Methods" hereinabove. The objective 
was to isolate a sufficient amount of the 30kDa protein for in situ tryptic digestion, peptide 
separation, and microsequence analysis. 

25 

Proteolytic digestion of each of the 30kDa forms produced identical peptides, 
indicating that the difference in these four forms is due to post-translational modifications of a 
single protein. The present inventors had previously shown that the difference in the isoelectric 
points for forms 3 and 4 is due to phosphorylation of forms 1 and 2. The 30kDa mitochondrial 
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proteins are processed from a larger precursor protein synthesized in response to hormone 
stimulation of MA- 10 cells. However, the precursor protein has a short half-life and is difficult 
to detect in MA- 10 cells (Stocco and Sodeman, 1991; Epstein and Orme- Johnson, 1991). For 
this reason the present inventors focused on the purification of the 30kDa proteins which are 
stable in mitochondria isolated from MA- 10 ceils. These proteins had been detected by 
metabolically labeling MA- 10 cells in vivo with [ 35 S]-methionine/cysteine and isolating the 
mitochondria for two dimensional SDS-PAGE analysis followed by fluorography. 

the present inventors first determined if the quantity of the 30kDa proteins in isolated 
mitochondria was sufficient to detect by protein staining. The 30kDa proteins were readily 
detectable by silver-stain after two dimensional SDS PAGE of mitochondria isolated from 
Bt 2 cAMP-stimulated MA- 10 cells. As expected, the proteins were absent in unstimulated cells. 
The major difficulty in the purification of the 30kDa proteins from MA- 10 cells was the lack of a 
bioassay, therefore, each step of the purification was monitored by silver-stained two 
dimensional SDS PAGE analysis of the proteins. 
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Table 3 summarizes the enrichment and recovery of the 30kDa proteins. 



Table 3* 

Purification of the 30-kDa Proteins from 
Bt 2 cAMP-stimulated MA-10 Cells 1 

Fraction Total protein Enrichment of the 

30-kDa proteins 



% recovery ^g 30-kDa -fold % recovery 

proteins/ enrichment 30-kDa 

mg sample proteins 
protein 

Mitochondria 100 7 1 100 

Mitoplasts 56 ±10 ND" ND ND 

CHAPS-soluble 22 ±6 18 2.6 55 

30-kDa fraction 0.6 ±0.3 94 ±28 13 7 



n=16 

'Shown are the results from the purification of the 30-kDa proteins from 40-50 mg of mitochondria. Each fraction was 
separated by two dimensionai-gei electrophoresis and the proteins were visualized by silver stain. 1 gel was analyzed for 
the mitochondrial fraction, 2 gels with different protein concentrations for the CHAPS-soluble fraction, and 4 gels with 
varying protein concentrations for the 30-kDa fraction were run simultaneously and stained for protein. The silver-stained 
image was captured, and the integrated intensity (II) of each spot (protein) was estimated using a Biolmage Visage 2000 
(Millipore). The percent of the total integrated intensity of each spot/protein was automatically determined. To estimate 
the amount of 30-kDa protein, the percent of total II for the 30kDa protein spots were multiplied by the amount of total 
protein loaded onto the first dimension gel. The results are shown as micrograms 30-kDa protein (sum of 1-4) per mg of 
fraction sample. 

a ND, mitoplast fraction not determined. 

The purification achieved a 13-fold enrichment with a 7% recovery of the 30kDa 
proteins. The goal was to sufficiently enrich the 30kDa proteins in a final fraction in order to 
resolve a sufficient quantity (1-2 ^g) of the specific 30kDa proteins by 2D SDS-PAGE. The ID 
preparative gel enriched the 30kDa proteins to approximately 100 Mg/mg of the final 30kDa 
fraction which allowed the present inventors to isolate the 30kDa protein. The 30kDa fraction 
was treated with alkaline phosphatase just prior to 2D SDS-PAGE to concentrate the 30kDa 
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proteins into forms 1 and 2 (Stocco and Clark, 1993). Comparison of the protein profiles for the 
30kDa fraction purified from mitochondria isolated from control and Bt 2 cAMP-stimulated MA- 
1 0 cells was used to verify that the correct protein spots, 30kDa 1 and 2, were isolated. Using 
this purification, approximately 75 mg of isolated mitochondria was required to isolate 
approximately 200 pmoi of the 30kDa proteins from Bt 2 cAMP-stimulated MA- 10 cells. 
Quantitatively, 60% of the total 30kDa proteins was recovered in 30kDa 2, and 40% was 
recovered in 30kDa 1. The difference between form 1 and form 2 may be that of methylation, 
acetylation, sulfation, prenylation, or myristylation, and the like. 

EXAMPLE 2 

Cloning the cDNA and Analysis of the Encoded 30kDa Protein 

The present example provides for the cloning and sequence analysis of the cDNA 
encoding the 30 kDa protein, and analysis of the protein sequence. Even though 30kDa 1 and 
30kDa 2 were thought to be identical proteins, they were isolated and stored separately. To 
ensure one homogeneous protein was used for microsequence analysis, only the 30kDa protein 2 
was sent to the Harvard Microchemical Facility where in situ digestion, tryptic peptide 
separation, and microsequence analysis was performed on a fee-for-service basis. Three tryptic 
peptides, #23, #25, and #45, were selected for microsequence analysis. The amino acid 
sequences for the peptides were determined to be: 

Peptide 23 : AEHGPTCMVLHPLA, (SEQ ID NO:3) 

Peptide 25 : ALGILNNQEGWK, (SEQ ID NO:4) 

Peptide 45 : GSTCVLAGMATHFGEMPEQ, (SEQ ID NO:5) 

The GenEMBL and SWISS-PROT data bases were searched for similarities to the 
three peptide sequences and no significant homologies were found (Fasta and TFasta programs, 
GCG Package, University of Wisconsin, Madison WL). 



-49- 



A cDNA library was constructed using polyA + RNA purified from total RNA that 
was isolated from Bt 2 cAMP-stimulated MA- 10 cells as described hereinabove. Using the amino 
acid sequences of the 3 peptides, degenerative oligonucleotides 17-24 bases in length were 
synthesized and used to amplify the 30kDa cDNA from the cDNA library by the polymerase 
5 chain reaction (PCR)(Saiki et aL, 1988). A 400bp specific PCR product was amplified using a 
combination of the peptide 25 coding and peptide 23 reverse complement oligonucleotides. The 
PCR generated DNA was used to probe the cDNA library and a 1456 bp full-length clone was 
isolated. Both strands of the cDNA were sequenced and a partial restriction map is shown in 
FIG. 1 . Also included are the positions of the PCR amplified sequence, the initiating ATG 
10 codon, and the termination TAA codon. FIG. 2 shows the nucleotide sequence of the 30kDa 
|1 cDNA which contains an open reading frame of 852 base pairs that encodes a protein of 284 
J\ amino acids with a calculated molecular weight of 3 1 .6 kDa. The deduced amino acid sequence 
W for the 30kDa protein is shown by three letter code under the nucleic acid sequence. The three 
%n peptide sequences derived from the protein microsequence analysis are encoded in the cDNA 
\3 (the amino acids are underlined in FIG. 2) which confirmed the translation reading frame. 

M Although the predicted molecular weight based on the deduced amino acid sequence 

S is lower than the observed size of the mitochondrial precursor protein by 2D SDS-PAGE, 
M inspection of the amino terminal amino acid sequence for the deduced protein revealed 
20 characteristics consistent with mitochondrial targeting sequences (von Heijne, 1986; von Heijne 
et aL, 1989). Namely, the first 25 amino acids lack acidic amino acids, are enriched in Arg 
(12%), Ser (8%), Ala (8%), and Leu (12%), and the predicted secondary structure is an 
amphipathic alpha helix. In addition, the amino acids at positions 38, 40, and 43 are Arg, Leu, 
and Ser, respectively, which would fit the amino acid consensus cleavage site, R-X-$-X-X-S, 
25 * where X represents any amino acid and $ represents a hydrophobic residue (Hendrick, et aL, 

1989). This amino acid motif is highly conserved in mitochondrial presequences that undergo a 
2 step sequential cleavage of mitochondrial precursors by the matrix processing protease (MPP) 
and the mitochondrial intermediate processing peptide (MIP) (Kalousek, et aL, 1988; Kiebler et 
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al } 1993). No significant similarities were found to the cDNA sequence when the GenEMBL 
and S WISS-PROT data bases were searched. 

The signal sequence is represented by amino acids at positions 1 to about 26 of SEQ 
ID NO:2, more particularly, from about amino acids at positions 10 to 26 of SEQ ID NO:2, The 
targeting sequence is represented by amino acids at about positions 36 to 47 of SEQ ID NO:2. 
The mature 30 kDa protein has methionine at position 48 as the N-terminal amino acid. 

To confirm that the cDNA clone encodes the precursor and mature mitochondrial 
proteins, the cDNA was transcribed in vitro and the synthesized RNA was used in an in vitro 
translation reaction. A two dimensional SDS-PAGE of the [ 35 S]-methionine labeled in vitro 
translated proteins in the presence of mitochondria demonstrated that the mobility of the proteins 
was identical to the LH-induced newly synthesized proteins in MA- 10 cells which were 
previously identified as the 37kDa precursor protein and the 30kDa mitochondrial proteins. 
Therefore, the cDNA obtained based on the amino acid sequence data for the 30kDa 2 protein 
encodes all forms of the previously described family of mitochondrial proteins. 

EXAMPLE 3 

Improved Production of Progesterone and StAR protein 
by Recombinant Means; Induction of Steroidogenesis 

The present example demonstrates that the expression of the 37 kDa protein has an 
effect on steroid production in mammalian cells. 

MA-10 cells were transfected with the 30kDa cDNA and progesterone production 
was measured as follows. The full-length cDNA was subcloned into the pCMV eukaryotic 
expression vector and transfected into MA-10 cells using LipofectAMINE (Life Technologies, 
Inc.). Cells (75,000 per well) were plated in a 96 well plate the day before transfection* The 
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10 



cells were incubated with the DNA/lipid transfection mixture for 6 hours, washed one time with 
PBS, and then incubated in Waymouths + 15% horse serum. Forty-eight hours post-transfection 
the cells were washed with PBS and Waymouths media (minus serum) was placed back onto the 
cells. After 6 h, the medium was removed and progesterone was measured by 
radioimmunoassay. The cells were lysed directly in the wells with 0. 1% SDS and protein was 
determined by the method of Bradford (1976). Progesterone production is shown in Table 4 as 
picograms progesterone produced per mg protein per 6 h. The transfected cells were not treated 
with hormone and progesterone was measured directly in the media after a 6 h incubation. A 
significant increase in progesterone production was observed in MA- 10 cells transfected with the 
30kDa cDNA compared to cells transfected with the pCMV vector alone (Table 4). Expression 
of the 30 kDa protein resulted in a 1.5 to 3.7 fold increase in steroidogenesis with an average rate 
of 166 pg progesterone produced per mg protein per hour. 



Table 4 

Progesterone production in MA-10 cells transiently transfected with the 30kDa cDNA 1 



30 



35 





Study 2 


n 


Control 
(nontrans- 
fected) 


pCMV 


pCMV + cDNA 


pCMV + 
cDNA/pCMV 










pg progesterone 




-fold increase 2 










/ mg protein/ 6 






20 








h 




3.4 




I 


3 


ND 3 


361 ±38 


1239 ± 347 


1.6 




II 


4 


ND 


317±77 


519±42 


3.7 




III 


8 


ND 


403 ± 145 


1775 ±444 


1.5 




IV 


8 


756 ±135 


787 ±174 


1148 ±174 


3.2 


25 


V 


8 


469 ±58 


428 ± 81 


1378 ± 233 


2.9 




VI 


8 


779 ±171 


1071 ± 143 


3146 ±768 





'Control (non-transfected) cells were grown in WAY+ for 48 h and washed once with PBS+ before WAY- media was 
added; pCMV, MA-10 cells were transfected with the pCMV vector alone; pCMV + cDNA, MA-10 cells were transfected 
with the pCMV + 30kDa cDNA. n represents the number of wells transfected for each experiment The progesterone was 
measured in each well and the mean + the standard deviation is shown for each study. The Student's t test was used to 
determine the statistical difference between the pCMV and pCMV + 30kDacDNA samples. 

2 In every study, the difference was significant with a p value <0.01. 

3 ND, not determined. 
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The level of expression of the 30kDa protein in the transfected MA- 10 cells was 
determined by immunoblot analysis. Antibodies having binding specificity for amino acids 88- 
98 of the 30 kDa proteins (see Materials and Methods) recognized a protein at approximately 
30kDa only in mitochondria isolated from Bt 2 cAMP-stimulated MA- 10 cells while no 
5 immunodetectable protein was observed in the non-stimulated ceils. In addition, the antibody 
recognized ail four 30-kDa protein spots specifically when mitochondrial proteins from 
stimulated MA- 10 cells were resolved by two-dimensional SDS-PAGE. Cell homogenates of the 
MA- 10 cells that had been transfected with pCMV + 30kDa cDNA and used for progesterone 
production for expression of the 30kDa protein were tested, however, no immunodetectable 

W protein was observed. Since the pCMV vector does not replicate in MA- 10 cells, and only 5% of 
the cells are transfected with plasmid DNA based upon expression of the tartrate-resistant acid 

!M reporter protein, it was not surprising that the protein could not be detected in cell homogenates . 

m collected from approximately 1 x 10 6 cells (8 wells from a 96 well plate). 

$ 5 In order to verify that the cDNA was being expressed in eukaryotic cells, COS 1 cells 

%j were used to transfect the pCMV + 30kDa cDNA since the pCMV plasmid can be replicated in 
jT these cells and approximately 80% of the cells are transfected with plasmid DNA based on the 
D expression of reporter protein. Forty-eight hours post-transfection, mitochondria were isolated 

and the 30 kDa protein expression was determined by immunoblot analysis. An immunospecific 
20 protein of approximately 30kDa was readily detectable in isolated mitochondria only from COS 
1 cells that had been transfected with the 30kDa cDNA, indicating the cDNA does express the 
same protein as the B^cAMP-stimulated MA- 10 cells. In addition, a 36.5kDa protein was 
detected in the COS 1 cells which would be consistent with the precursor protein. Importantly, 
the 30kDa protein was also detectable in MA- 10 cells transfected with pCMV + 30kDa cDNA by 
25 immunoblot analysis when isolated mitochondria were analyzed. The level of expression of the 
30kDa protein in the transfected MA-10 cells was approximately 7% of that observed in the 
Bt 2 cAMP-stimulated MA-10 cells. Thus, these data indicate that the expression of the 30kDa 
protein is sufficient to induce steroid production in MA-10 cells in the absence of hormone 
stimulation. 
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The present inventors have named this protein the Steroidogenic Acute Regulatory 
protein, StAR. While not wishing to be bound by any particular theory regarding a mechanism 
of action, the following working model for the acute regulation of steroidogenesis in Leydig cells 
by StAR is provided. The precursor protein is rapidly synthesized in the cytosol in response to 
hormone stimulation. The precursor binds to a receptor on the mitochondrial membrane and 
processing begins. Processing consists of the N-terminus entering the mitochondrial matrix and 
being cleaved to the 30 kDa form. It is during the processing that contact sites between the inner 
and outer mitochondrial membranes form and this very hydrophobic environment provides the 
medium through which cholesterol may pass to the inner membrane. Thus, it may be the 
processing of StAR from a 37 kDa form to a 30 kDa form that is functionally active in the 
transport of cholesterol and results in increased steroid production. This protein is also 
demonstrated to bind cholesterol (See Example 18). 

EXAMPLE 4 
Expression of StAR Precursor Protein in E. coli 

The present example describes studies carried out to express the 37 kDa precursor 
StAR protein in E. coli for overproduction thereof. 

The £. coli expression vector, pKK233-2, (Clontech, Palo Alto, CA) contains an 
IPTG-inducible promoter (PtJ, a LacZ ribosome binding site, and a unique Nco\ cloning site 
that provides an ATG initiation codon. Since expression of mitochondrial proteins tends to be 
toxic to bacterial host cells, basal levels of expression can be greatly reduced in £. coli strains 
that overproduce the lac repressor (lacP) and optimal expression can be achieved by induction 
for a short period of time, 

StAR cDNA was cloned in pSPORT vector (GIBCO Life Technologies, Gaithersburg, 
MD) and this vector was used for PCR amplification of the coding sequence for StAR, Primers 
were designed to introduce restriction sites (Ncol at the 5' end and Hindlll at the 3' end) for 
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directional subcloning into the pKK233-2 vector. Ligation of the amplified StAR cDNA 
fragments with the pKK233-2 vector constructed a recombinant pKK233-2/StAR piasmid. The 
E. coli strains, JM109 and DHSaF'IQ, were transformed with this piasmid and maintained in LB 
media (10 g tryptone, 5 g yeast extract, 10 g NaCl, and 100 ^g/ml ampicillin). For expression, a 
fresh culture (15-30 ml LB media) inoculated with the transformed E. coli was grown to an 
ODooq = 0.6 at 37 °C, then IPTG (isopropyl p-D-thiogalactopyranoside) was added to a final 
concentration of 1 mM. Protein expression was induced at 30 °C for 2 hours and cells were 
harvested. Cells were lysed by sonication in buffer containing 50 mM Tris (pH 8.0), 150 mM 
NaCl, 0.02% sodium azide, 0.1% SDS, 1% NP-40, 0.5% sodium deoxycholate, 100 jig/ml 
phenylmethylsulfonyl fluoride, 2 jig/ml aprotinin, and 2 jag/ml leupeptin. 

From staining patterns on gels, it is apparent that significant amounts of StAR protein 
were made. The extract of E. coli DHSaF'IQ with the StAR cDNA insert contained a 37 kDa 
polypeptide which was approximately 3% of the total cellular protein. This 37 kDa protein was 
not detected in E coli without the piasmid. The protein reacted with the antibody raised against 
amino acids 88-98 of StAR. 

For isolation and purification of the StAR precursor protein, the sonic lysate was 
ultracentrifuged (100,000 x.g, lh), and the supernatant passed through an affinity column packed 
with Protein A agarose beads crosslinked with the anti-StAR antibody having binding specificity 
for amino acids 88-98 of StAR. The fractions enriched for StAR precursor protein were 
concentrated and further purified by passage through a gel filtration column (packed with 
Sephadex G-75 beads). Flow-through fractions were tested for the StAR precursor protein by 
Western blot analysis and the purity was assayed using a silver staining method. 
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EXAMPLE 5 

StAR is Hormonally Regulated and Developmentally Regulated 



The present example provides studies that show that the production of StAR protein is 
hormonally regulated, as well as developmentally regulated in vivo. 

MA- 10 cells were stimulated with Bt 2 cAMP and StAR mRNA levels were 
determined by Northern blot analysis. Within 1 hour of Bt 2 cAMP stimulation, two major 
transcripts of approximately 3400nt and 1600nt, and one minor transcript of 2700nt, were 
detected. StAR mRNAs were markedly induced (20x) after 2 hours of hormone stimulation with 
maximal levels obtained after 6 hours. Subsequent to the marked mRNA induction, the greatest 
induction (lOx) in StAR protein was detected after 4 hours of Bt 2 cAMP stimulation by 
immunoblot analysis. Comparatively lower levels of StAR could be detected after 1 hour of 
hormone stimulation with maximal levels accumulated within 8 hours. 

Hormone-induced progesterone production rose above basal levels in MA- 10 cells 
typically within 1 hour with the greatest increase in rate of steroid output measured between 2-4 
hours of hormone stimulation, consistent with the induction in StAR protein. 

These data indicate that StAR is transcriptionally regulated by a cAMP-mediated 
mechanism. Immunoblot analysis of several mouse tissues indicates that StAR protein is 
expressed in the adrenal, ovary, and testis, and is not expressed in brain, muscle, liver, kidney, 
spleen, heart, uterus, or placenta. The developmental expression of StAR was assessed by in situ 
hybridization analysis of mouse embryonic tissue. Earliest detection of StAR transcripts was at 
embryonic day 10.5 (E10.5) in the genital ridge. By E12.5-E14.5, StAR was readily detected in 
the interstitial cells of the testis and adrenal cortex and continued to be expressed in the adult. 
StAR expression was absent in the ovary at E12.5-E14.5, but was abundant in the adult ovary. 
The developmental pattern of expression for StAR parallels that observed previously for 



-56- 



cytochrome P450scc which provides further supporting evidence for the importance of StAR in 
steroid hormone biosynthesis. 

The present inventors have generated stable transfected MA- 10 cell lines that 
constitutively synthesize the 30kDa protein and that produce steroid constitutively at a level 
several fold higher (about 9x) than basal parental MA- 10 cells. 

EXAMPLE 6 
Screening for Mutations in the StAR Gene 
for Identifying Steroid Hormone-Dependent Pathologies 

The present example provides methods by which the nucleic acid molecules, in 
particular, fragments of the nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO: 1 5 may be used 
to detect mutations within the StAR gene. These screening techniques may be used to identify a 
number of different pathologies; particularly steroid hormone-dependent pathologies, such as, for 
example, lipoid congenital adrenal hyperplasia (LCAH), adrenal hypoplasia congenita, 
hypogonadotropic hypogonadism, precocious puberty, or McCune-Albright syndrome. 

The identification of highly conserved mutations and the development of an 
appropriate screen would provide regulations and standards for clinical testing and screening for 
these metabolic disorders. 

Lipoid Congenital Adrenal Hyperplasia (LCAH) 

StAR DNA may be used for screening adrenal tissue obtained from patients with 
Lipoid Congenital Adrenal Hyperplasia (LCAH) to test for a possible role of StAR in the disease 
state. This can be achieved with Southern blotting and hybridization with a cDNA probe. Fairly 
large DNA rearrangements of greater than 500 bp may be detected in this manner. However, it 
may be that the mutations within the StAR gene resulting in steroid hormone-dependent 
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pathologies are too small to detect by Southern blotting. This would be the case if they are due 
to point mutations or to small insertions, deletions or other rearrangements. 

Smaller StAR gene mutations are detected by DNA sequencing which can be 
performed on a genomic DNA template, a cDNA template prepared from RNA by reverse 
transcriptase, or on a PCR product. In an attempt to detect mutations rapidly, several methods 
are available; chemical cleavage, denaturing gradient electrophoresis (DGGE) and ribonuclease 
cleavage, and single strand conformation polymorphism. These methods and others may be used 
in conjunction with the present invention and may be performed after PCR amplification of the 
DNA region under study. 

Testicular tissue of two patients and genome DNA of a third patient with LCAH have 
mutations in the gene for StAR (Lin et al, Science, 267:1828-1831, 1995) (incorporated herein 
by reference). These mutations consist of C to T transitions in the gene sequences, which 
resulted in the premature insertion of stop codons.. This resulted in the truncation of StAR 
protein by 28 amino acids in two of the patients and 93 amino acids in another. These 
truncations were confirmed by Western analysis following expression of the mutated cDNAs in 
COS cells. Virtually none of the precursor form of StAR expressed from the cDNA of these 
patients was converted to the mature mitochondrial form. Expression of the StAR cDNA from 
these patients in COS 1 cells indicated that the protein produced was inactive in its ability to 
promote steroidogenesis, whereas the normal protein resulted in an 8-fold increase in steroid 
production when expressed 

EXAMPLE 7 
Gene Therapy 

This prophetic example describes some of the ways in which the present invention 
may be of use in the treatment of steroid hormone-dependent disorders, especially those 
characterized as involving defects in cholesterol transport. 
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A wild-type human StAR gene may be introduced into human tissue to provide a 
wild-type copy of the gene and therefore, also a wild-type protein product, that may correct the 
genetic lesion that causes the steroid hormone-dependent disorder 

Human adenovirus or retrovirus are means for introducing genes into tissue. 
Adenoviruses are double-stranded DNA tumor viruses with genome sizes of approximate 36 kb. 
As a model system for eukaryotic gene expression, adenoviruses have been widely studied and 
well characterized, and have already been used in a gene transfer system (see e.g., WO9506743, 
WO9502697, WO9500655, W09428938, W09419478, and W09412649, each publication is 
incorporated by reference herein) . This group of viruses is easy to grow and manipulate, and 
they exhibit a broad host range in vitro and in vivo. In lytically infected cells, adenoviruses are 
capable of shutting off host protein synthesis, directing cellular machineries to synthesize large 
quantities of viral proteins, and producing copious amounts of virus. 

In general, adenovirus gene transfer systems are based upon recombinant, engineered 
adenovirus which is rendered replication-incompetent by deletion of a portion of its genome, 
such as El , and yet still retains its competency for infection. Relatively large foreign proteins 
can be expressed when additional deletions are made in the adenovirus genome. For example, 
adenoviruses deleted in both El and E3 regions are capable of carrying up to 10 Kb of foreign 
DNA and can be grown to high titers. Persistent expression of transgenes follows adenoviral 
infection. 

Particular advantages of an adenovirus system for delivering foreign genes and their 
protein products to a cell include (i) the ability to substitute relatively large pieces of viral DNA 
with foreign DNA; (ii) the structural stability of recombinant adenoviruses; (iii) the safety of 
adenoviral administration to humans; and (iv) lack of any known association of adenoviral 
infection with cancer or malignancies; (v) the ability to obtain high titers of the recombinant 
virus; and (vi) the high infectivity of adenovirus. 
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Further advantages of adenovirus vectors over retroviruses include the higher levels 
of gene expression. Additionally, adenovirus replication is independent of host gene replication, 
unlike retroviral sequences. Because adenovirus transforming genes in the El region can be 
readily deleted and still provide efficient expression vectors, oncogenic risk from adenovirus 
vectors is thought to be negligible. 

Patients testing positive for LC AH and for whom the medical indication for 
adenovirus-mediated gene transfer has been established, would be tested for the presence of 
antibodies directed against adenovirus. If antibodies are present and the patient has a history of 
allergy to either pharmacological or naturally occurring substances, application of a test dose of 
on the order of 10 3 to 10 6 recombinant adenovirus under close clinical observation would be 
indicated. 

Recombinant adenovirus providing the wild-type StAR gene may be prepared and 
purified by any of a variety of methods, so as to provide a preparation suitable for administration 
to human subjects, including, but not limited to cesium chloride density gradient centrifugation, 
and subsequently tested for efficacy and purity. Virus is administered to patients by means of 
intravenous administration in any pharmacologically acceptable solution, either as a bolus or as 
an infusion over a period of time. Generally speaking, it is believed that the effective number of 
functional virus particles to be administered would range from 5 x 10 10 to 5 x 10 12 . 

Patients would remain hospitalized for at least 48 hr to monitor acute and delayed 
adverse reactions. Serum levels of a protein product may be monitored or Southern blots may be 
performed to follow the efficacy of the gene transfer. Adjustments to the treatment may include 
adenovirus constructs that use different promoters or a change in the number of pfu injected. 
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EXAMPLE 8 

Expression of StAR induces Steroid Production in the absence of Hormone Stimulation 



The present example demonstrates that StAR expression induces steroid production in 
the absence of hormone stimulation. 

Transient transfection of COS 1 cells with StAR resulted in a several fold increase in 
steroid production in the absence of hormone stimulation. Transient transfection of COS 1 cells 
with StAR was observed to provide in a several fold increase in steroid production. Also, 
increases in StAR mRNA and protein closely paralleled steroid production indicating a temporal 
relationship in these parameters. 

EXAMPLE 9 
Calcium Stimulates StAR Protein 

The effect of changes in [Ca 2+ ] c on intramitochondrial cholesterol and the distribution 
of StAR protein in mitochondria during activation by Ca 2+ is described in the present example. 

In adrenal glomerulosa cells, angiotensin II (Ang II) stimulates aldosterone synthesis 
through rises of cytosolic calcium, [Ca 2+ ] c . The rate-limiting step is the transfer of cholesterol to 
the inner mitochondrial membrane, where it is converted to pregnenolone by the P450 side chain 
cleavage en2yme (P450scc)» This transfer is believed to occur as the 37 kDa precursor of the 
Steroidogenic Acute Regulatory (StAR) protein is imported into the mitochondria. 

Freshly-prepared bovine zona glomerulosa cells were stimulated with Ang II (10 nM) 
or submitted to a cytosolic Ca 2+ clamp (600 nM) for 2 h. Mitochondria were isolated and 
subfractionated into outer membranes (OM), inner membranes (IM) and contact sites (CS). 
Cholesterol content was determined by the cholesterol oxidase assay. 
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When glomerulosa cells were exposed to Ang II, a marked increase of cholesterol in 
CS occurred (to 172±28% of controls, n=3). No significant changes were detected in OM 
cholesterol, suggesting a stimulation of cholesterol supply to the mitochondria in response to 
Ang II. Stimulation of intact cells with Ca 2+ led to a marked decrease in cholesterol content of 
OM (to 54±24% of controls, n=5). Cycloheximide specifically and significantly reduced Ca 2+ - 
activated cholesterol transfer to IM. Western blot analysis revealed a cycloheximide-sensitive 
increase of StAR protein (to 141±14% of controls, n=5) in mitochondrial extracts of Ca 2+ - 
mobilizing, agents, newly synthesized StAR accumulates in IM after transiting through CS. One 
of the main functions of the Ca 2+ messenger is to increase cholesterol supply to the P^SQ^qq 
enzyme by enhancing endogenous intermembrane cholesterol transfer. The import of StAR 
protein to IM is accompanied by cholesterol transfer, thus promoting the activation of the 
steroidogenic cascade. 

EXAMPLE 10 

Angiotensin II Stimulates Intramitochondrial Cholesterol Transfer 
and StAR Protein in Bovine Adrenal Glomerulosa Cells 

The effect of Ang II on intramitochondrial cholesterol and the distribution of StAR 
protein in submitochondrial fractions during activation by Ang II is described in the present 
example. 

Freshly-prepared bovine zona glomerulosa cells were stimulated with Ang II (10 nM) 
or submitted to a cytosolic Ca 2 * clamp (600 nM) for 2 h, as described in Example 9. 
Mitochondria were isolated and subfractionated into outer membranes (OM), inner membranes 
(IM) and contact sites (CS). Cholesterol content was determined by the cholesterol oxidase 
assay, also as described in Example 9. 

StAR protein in Ang II stimulated (to 157% of controls, n=2) glomerulosa cells. Ang 
II increased StAR in IM, and this effect was prevented by cycloheximide. 
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EXAMPLE 11 
StAR and a Water Soluble CaM Kinase II Inhibitor 



The present example demonstrates the effect of a water soluble CaM kinase II 
inhibitor on the agonist inductor of StAR protein. The water soluble CaM kinase II inhibitor, 
KN93, was used in the present example and is representative of the water soluble CaM kinase II 
inhibitors generally. 

The human adrenocortical carcinoma-derived cell line (H295R) was used. This cell 
line secretes multiple steroids, including aldosterone and Cortisol. This cell line is an appropriate 
model system to investigate the acute regulation of human aldosterone synthesis, as described in 
Bird et al (1993) (Endocrinology, 133, 1555-1561), which is specifically incorporated herein by 
reference. 

To further investigate the site of KN93 action, the effect of KN93 on agonists 
induction of the StAR protein, shown herein to regulate movement of cholesterol from the outer 
to the inner mitochondrial membranes, was examined. The amount of StAR protein was 
increased following treatment of H295R cells with angiotensin II (Ang II) potassium (K), and 
Bay K (a calcium channel activator) as shown by the present investigation (See Clark et ai, 
1995, Moi. Cell Endocrinol, 115:215-19). KN93 at concentration between 1 and 3 \xM 9 which 
blocked steroidogenesis by 60 to 80%, did not affect induction of StAR protein by Ang II, K + , or 
Bay K. These results support the finding that CaM kinase II is involved in the process of 
cholesterol mobilization to the mitochondria. 

Materials and Methods 

Materials 

[Val 3 ]-Angiotensin II acetate (Ang II), potassium chloride, 22R-hydroxlycholesterol 
(22ROHChol), d-aldosterone, hydrocortisone (Cortisol) and laboratory reagents were from Sigma 
Chemicals (St Louis, MO). Dibutyryl cAMP (dbcAMP) was from Aldrich Chemicals (Milwaukee, 
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WI). (=)-Bay K 8644 (Bay K) was from Research Biochemicals International (Natick, MA). The 
protein kinase inhibitor KN93 (2-|>H2-hydroxye%l)-N-(4-m^ 

cWorocinnamyl)-N-methylbenzyiamine) was from Seikagaku America, Inc. (Ijamsville, MD). The 
calmodulin inhibitor compound R 24571 (calrnidazolium) was from Janssen Pharmaceutica (Berse, 
Belgium). 

Cell culture 

H295R cells were initially obtained as NCI-H295 cells from the American Type Culture 
Collection (Rockville, MD), and then selected as described previously (Bud et al (1993)). 
Reflecting growth and culture differences between the original ATCC cells and the selected 
subpopulation, these cells are designated as H295R cells and are available from ATCC as such. 
Cells were maintained in a 1 : 1 mixture of Dulbecco's modified Eagle's and Ham's F 12 
(DME/F12) medium containing pyridoxine HCI, L-glutamine and 15 mM Hepes (Gibco BRL; 
Gaithersberg, MD); medium was supplemented with insulin (6.25 ^ig/rnl), transferrin (6.25 
|ng/ml), selenium (6.25 ng/ml), bovine serum albumin (1.25 mg/ml), and linoleic acid (5.35 
^ig/ml) added in the form of 1 % ITS plus (Collaborative Biomedical Products; Bedford, MA). 
In addition, cells were grown in the presence of either 2% low protein serum replacement (Sigma 
Chemicals; St. Louis, MO) or 2.5% Nu serum type I (Collaborative Biomedical Products; 
Bedford, MA), as well as antibiotics. Cells were maintained and grown in 75 cm 2 flasks at 37 °C 
under an atmosphere of 5% CCy95% air, subcultured onto 12 well plates and used for 
experiments 48 hours later as indicated below. 

Stimulation of steroid secretion and analysis of steroids 

Subcultured cells were maintained 24 hours in DME/F12 medium containing 0.2% calf 
serum, 0.01% BSA and antibiotics (low serum). Cells were preincubated with KN93 or 
calrnidazolium for 30 minutes at 37°C. Fresh low serum medium containing the agents as 
indicated was then added to the cells and the incubation carried out at 37°C for the indicated times. 
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The aldosterone and Cortisol contents of medium recovered from each well were determined with 
aldosterone and Cortisol standards prepared in low-serum medium by commercial aldosterone and 
Cortisol radioimmunoassays (Diagnostic System Laboratories; Webster, TX). Results of 
aldosterone and Cortisol assays were normalized to cellular protein per well, expressed as pmol per 
5 mg cell protein and transformed to percentage of the control response where indicated. IC^ values 

for calmidazolium and KN93 were calculated by taking the difference between basal and 
stimulated values as 100%. 

Protein determination 

10 Cells were solubilized in Tris-HCl (50 mM pH 7.4) containing NaCl (150 mM), SDS (1%), 
y EGTA (5 mM), MgCl 2 (0.5 mM), MnCl 2 (0.5 mM), and phenylmethylsulfbnylfluoride (PMSF 0.2 
P mM), and stored frozen at -20 °C. Protein content of samples was then determined by the 

bicinchoninic acid protein assay, using the BCA assay kit (Pierce, Rockford, IL). 

f5 

O Immunoblot analysis 

11 For each treatment, cells were solubilized as described above and an equivalent amount of 
Jf protein (30 \ig) for each sample were separated by SDS-PAGE (12.5%) and then electrophoretically 
0 transferred to polyvinyiidene difluoride membrane as previously described (Clark et ai (1994) J. 
20 Biol. Chem., 269:283 14-322, reference specifically incorporated herein by reference). Immunoblot 

analysis was as described (Id) using a rabbit sera containing the specific StAR antipeptide antibody 
(see ANTIBODIES infra) as the primary label and a donkey anti-rabbit IgG conjugated with 
horseradish peroxidase (Amersham, Arlington Heights, IL). The specific signal was detected by 
chemiluminescence assay using the Renaissance kit from DuPont NEN (Boston, MA) and the StAR 
25 specific bands were quantitated by Biolmage Visage 2000 computer-assisted image analysis 
(Biolmage, Ann Arbor, MI). Different exposure times were used to insure linearity. 

Statistical analysis 

Statistical analysis of the data was by analysis of variance, followed by Student-Newman- 
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Keuls multiple comparison analysis or, for transformed data, by the Mann- Whitney U test. 

KN93 effects on StAR induction 

The data above suggest that CaM kinase II plays a role in steroidogenesis prior to P450scc 
conversion of cholesterol to pregnenolone. StAR protein levels were elevated by Ang II, K\ and 
Bay K in H295R cells . To discriminate between the potential sites of CaM kinase II involvement 
in aldosterone production, the effects of KN93 on StAR protein expression were also examined. 
Agonist-induction of StAR protein expression was not affected by co-treatment with KN93 at a 
concentration (3 |iM) which potently inhibited aldosterone production. 



EXAMPLE 12 

Transcription Repressor DAX-1 Inhibits StAR mRNA and Protein Synthesis; 

Role of StAR in Screening for DSS 

The overexpression of DAX-1 in Y-l adrenal tumor cells results in a complete inhibition of 
steroid synthesis and an accompanying inhibition of StAR mRNA and protein synthesis. The 
present example demonstrates this phenomenon. 

Male to female sex reversal has been observed in individuals with duplications of the short 
arm of the X chromosome. The study of Xp duplicated patients demonstrated that sex reversal 
results from the presence of two active copies of the DSS (Dosage Sensitive Sex Reversal) locus. 
A double dosage of the DSS disrupts testis formation while its absence is compatible with a male 
phenotype, suggesting a role for DSS in ovarian development and as a link between ovary and 
testis formation, DSS has been mapped to a 160-bp region of human chromosome Xp21, which 
includes the adrenal hypoplasia congenita (AHC) locus. 
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It is contemplated by the present inventors that the DSS is DAX-1, an unusual member of 
the nuclear hormone receptor superfamily. DAX-1 maps to the DSS critical region, and is 
responsible for X-linked adrenal hypoplasia congenita (See Swain et al. 1996, Nature Genetics, 
12: 404-409). 

5 

The present inventors contemplate that when a double dose of DSS is present, it can bind 
to SF1 sites at a higher amount than usual, thus turning off SF1 gene (such as StAR, p450 and 3- 
bHSD). One would expect to see no testosterone synthesized, and the male sex organs would not 
develop. Hence, the present observation that DAX-1 inhibits StAR synthesis suggests a use of 

IP StAR nucleic acid in a screening assay for the disease DSS and constitutes a method of use as part 

yi of the present invention. 

J SCREENING FOR DOSAGE SENSITIVE SEX REVERSAL 

4p The present example describes the utility of using mouse StAR mRNA sequence, or the 

f| StAR mRNA sequence as defined in SEQ ID NO: 14, which is the coding region of the mouse 
/: mRNA sequence from nucleotide position 210 to 931, in a screening assay for DSS. This region 
O shares about 84% identity with the human StAR mRNA sequence from nucleotide position 267 
r to 988 (SEQ ID NO: 15), in a method for screening a patient sample for Dosage Sensitive Sex 

Reversal (DSS). The screening procedure may also be conducted using the nucleic acid sequence, 
20 SEQ ID NO: 15, or a fragment of that sequence. It is anticipated that any of the primers described 

herein may be used as a PCR primer to generate an about 400 bp piece of nucleic acid that may 

be used for the present screening assay. 

A tissue sample from a patient to be screened would be tested to determine the presence 
25 of nucleic acid that demonstrated hybridization to the probe sequence for StAR by RT-PCR using 
any one of the above described StAR encoding sequences. If presence of StAR nucleic acid is 
detected, the patient sample will be determined not to evidence DSS. If presence of StAR nucleic 
acid is detected, then the patient will be determined to have DSS, or to be at risk of developing 
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this disease. This screening protocol may be used together with or clinically detectable symptoms 
of the patient in determining the appropriate treatment for the patient. 

Children diagnosed with DSS have been found to have elevated, and most times twice, the 
level of a mutated form of the protein, DAX-1 (See Zanaria et al. (1994) Nature 372(15):635- 
641). DSS is a disease that affects primarily males, and that causes afflicted males to 
phenotypically revert to females. (See Namabe et aL (1992) Hum. Genet, 90:211-214). The 
present invention contemplates the use of StAR in the diagnosis of this disease, as it is 
contemplated that the StAR gene is not functional or is not expressed at sufficient levels in these 
patients, thus resulting in the phenotypically detectable sex reversal, 

EXAMPLE 13 
Mouse StAR Transcripts and Human StAR Transcripts 
Specific for StAR mRNA 

Three transcripts specific for StAR mRNA have been detected in the mouse (-1 .6 kb, -2.7 
kb, and -3.4 kb) and in the human (-1 .6 kb, -4.4 kb, and -7.5 kb). 

Differences in the lengths between the mouse and human transcripts were identified. The 
difference in length is attributed, at least in part, to a difference in the length of the 3-intranslated 
regions. 

The additional length was determined to exist at a site 31 of the Xhol cleavage site, as 
determined in the following study: 

Digest X DNA purified from bacteriophage with Not 1/Sal 1/Xho 1 to release the Not 1 and Sal 1 
insert (cloning strategy) and Xho 1 to determine which end of the cDNA has the additional 
sequences. 
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Results: StARcDNA 

Clone 1 (StAR cDNA) -1500 bp, mouse StAR sequence expect -61 Ibp and 939 bp fragments 
Clone 2 (FP#1)- 1500 bp 
Clone 3 (FP#12) - 1650 bp 
Clone 4 (FP#5) - 2400 bp 
Clone 5 (FP#7) - 2400 bp 
Clone6(FP#16)-2350bp 
Clone 7 (FP#13)- 2800 bp 
Conclusion: additional length in human StAR cDNA is 3' of the Xho 1 site. 

EXAMPLE 14 
Mouse StAR Identity to Other Species 

Full length cDNA clones for StAR have been isolated for the mouse and human (FIG. 3). 
cDNA for human StAR was isolated from a human adrenal library. The deduced amino acid 
sequence from the human StAR DNA was about 87% identical to the mouse sequence. The cDNA's 
have approximately 84% homology. The structural gene for StAR has also been isolated and 
characterized for both the mouse and human. The genes span 6.5 kb in the mouse and 8 kb in the 
human with the intronic sequences contributing to increased length in the human. Both are 
organized into seven exons and six introns with exons III-XI being of identical size. 

A StAR pseudogene was identified by RT-PCR amplification of RNA from human testis and 
PCR amplification of human genomic DNA (Sugawara et al (1995), Lin et al. (1995)). Sequence 
analysis of the pseudogene indicated that it lacks introns and has several nucleotide insertions, 
deletions, and substitutions (Sugawara et al. (1995)). However, detection by RT-PCR suggests that 
the pseudogene is transcribed. 

Southern blotting of somatic cell hybrids followed by fluorescent in situ hybridization was 
used to map the human structural gene to chromosome 8p. 11.2 and the pseudogene to chromosome 
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13 (Sugawara et al (1995)). The human promoter, a U-kb upstream sequence, can confer both 
basal and cAMP-dependent transcriptional activation of a luciferase reporter gene in Yl mouse 
adrenal cells (Id). Similarly, -1 kb of the S'-upstream sequence of the mouse StAR gene has been 
observed to drive expression of human GH reporter in Yl cells. The suggestion that StAR is a 
potential target for SF-1 is supported in studies that demonstrate that SF-1 binds to both the murine 
and human promoters and contributes to the transcriptional activity of StAR. 

StAR protein sequence is highly conserved with 85%-88% identity and greater than 90% 
similarity in the species studied to date (FIG. 5). In addition, a partial cDNA has been isolated from 
the sheep that has 80% identity with the corresponding regions of the other species. The greatest 
divergence appears to center around the putative mitochondrial signal sequence cleavage site 
described here for the mouse sequence. This region of the protein contains an amino acid motif that 
is highly conserved in presequences that undergo a sequential two-step cleavage by the matrix- 
processing protease and the mitochondrial intermediate-processing peptide, respectively. 

The submitochondrial localization of StAR has been determined using protein-A gold 
labeling of immuno-reacted StAR in mouse adrenal zona fasciculata cells. Colloidal gold particles 
were concentrated within the mitochondria to the intermembrane space and the intermembrane space 
side of the cristae membrane (King et al (1995)). 

In vitro transcription/translation systems have been used to demonstrate that isolated 
mitochondria are competent to import and process both mouse and bovine StAR protein. Rat heart 
mitochondria were used for the import assay with bovine StAR, indicating import of StAR is not 
dependent upon factors specific to mitochondria isolated from steroidogenic tissues. The ability of 
StAR to increase steroid production has also been confirmed in an in vitro reconstituted system. 
StAR protein added to mitochondria isolated from MA- 10 mouse Leydig tumor cells has been 
observed to result in a time- and dose-dependent increase in pregnenolone synthesis. This 
stimulation was shown to be specific for StAR in that pregnenolone synthesis was not affected by 
addition of another mitochondrial imported protein, adrenodoxin. 
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Computer analysis of the StAR protein sequence has identified three putative PKA/Cam 
kinase II phosphorylation sites and one PKC phosphorylation site (FIG. 5). Site-directed 
mutagenesis of StAR cDNA, which changes these putative target residues, will be used to determine 
if the phosphorylation of StAR itself is required for mitochondrial import, and that phosphorylation 
5 of StAR is directly linked to the steroidogenic response of the cell to hormone stimulation. 

The high degree of similarity between the nucleotide sequence for the mouse and human 
StAR does not extend into the S'-flanking regions of the genes. However, the human StAR promoter 
also contains a putative SF-l/AdBP4 binding site located -926/-918 relative to the start site of 
W transcription in addition to two putative Spl consensus consequences (Sugawara et al (1995)). 

;H The coding region of the human StAR cDNA has been characterized and is shown in SEQ 

CO ID NO: 19. The coding nucleotides are capitalized, whereas the untranslated nucleotides are in 

as, S 

j£ lowercase. 

Toward more definitive studies on the mechanisms regulating StAR gene expression, 
O approximately 1 kb and 1 .3 kb of the 5-flanking regions of the mouse and human gene, respectively, 
S have been isolated and sequenced Inspection of the murine StAR promoter does not provide many 

clues to putative exacting regulatory elements; it lacks a canonical TATA box and does not contain 
20 a consensus cAMP-responsive element Thus, like most of the cytochrome P450 steroid hydroxylase 

genes that are also regulated by cAMP but lack classical CREs, the regulatory regions of StAR may 

be unique. 

EXAMPLE 15 

25 StAR is Regulated by Oestradiol 

This present study demonstrates that oestradiol alters components of the steroidogenic 
pathway between cholesterol and P450 sec. 
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Oestradiol inhibits testosterone (T) secretion in rams, and at least part of this action is 

gonadotrophin-independent There is therefore a local effect on testicular steroidogenesis. Low-titre 

immunization of rams against oestradiol increases T secretion without affecting testicular 

steroidogenic enzyme activity or expression. Adult rams, during the breeding and nonbreeding 

5 seasons, were injected (iv every 3-4 days) with enough oestradiol antiserum to maintain an antibody 

titre of about 1:200. At this titre, the antibody does not cross-react with T. Blood samples were 

taken every 20 minutes for 10 h (d 21 of immunization) for pulsatile T and LH measurements. 

Pooled samples were used for assay of cholesterol, HDL and LDL+VLDL concentrations. The 

« 

testes were then removed and RNA extracted for measurement of LDL receptor and StAR mRNA 
10 levels. 

f % Mean, basal plasma T concentrations were not altered by treatment or by season (P>0.05). 

ftj Relative levels of StAR mRNA were 67% higher in breeding vs nonbreeding season (P<0.01), and 
ffi 21% higher in immune vs control rams (PO.05) irrespective of season. Low-titre oestradiol 
5 immunization increases T secretion by increasing StAR mRNA abundance and thus the delivery of 
£3 cholesterol to P450scc. These data provide evidence that StAR is regulated by oestradiol in the 
testis. 

O EXAMPLE 16 

20 Binding of StAR Protein to a Mitochondrial Membrane Protein Complex 

The following studies demonstrates that cholesterol transfers to the inner mitochondrial 
membrane only as StAR protein is being imported. 

25 A protein complex was purified from MA-10 cell mitochondrial membranes using an affinity 

column constructed by crosslinking the C-terminus of commercially synthesized StAR signal peptide 
to Sepharose 4B beads modified with hydrazine. Specific binding of StAR signal peptide to the 
complex was examined. When biotinylated StAR signal peptide was incubated with the protein 
complex, followed by crosslinking with disuccinimidyi suberate, it shifted in size from 3.5 kDa to 

-72- 



approximately 300 kDa upon one dimensional SDS polyacrylamide gel electrophoresis. The binding 
of biotinylated StAR signal peptide to the protein complex was completely inhibited by 
preincubation with 200 \iM unlabeled StAR signal peptide. Conversely, preincubation with 200 (iM 
of the signal peptide of ornithine transcarbamylase (OTC) or 100 ^tM of the signal peptide of the Fl- 
5 ATPase-fi subunit, two other mitochondrial proteins, did not show similar inhibitory effects on 
binding. This protein complex was then incorporated into liposomes for further examination. 
Incubation of 35 S-labeled StAR signal peptide with the liposomes resulted in a sharp peak of 
radioactivity which co-eluted with the liposome fraction recovered from a Sephacryl S-200 column. 
Preincubation of unlabeled StAR signal peptide with the liposomes reduced total radioactivity by 
|0 65% in this fraction, whereas OTC signal peptide did not reduce the binding to the liposomes. These 
y3 results suggest that the signal peptide region of the StAR protein is capable of specifically binding 
i to a mitochondrial membrane protein complex. 

C EXAMPLE 17 

15 Corticotropin-Releasing Hormone (CRH) and Testosterone - Effects of StAR 

i2 CRH treatment of MA- 1 0 mouse Leydig tumor cells results in a dose dependent stimulation 

y of testosterone production (Biol Reprod 53 :620-626 (1 995)). In view of this observation, the effects 
O of CRH on the synthesis of the StAR protein in these cells was examined. 
20 

Treatment of MA- 1 0 cells with CRH resulted in a dose-dependent increase in the synthesis 
of the StAR protein with a maximal response observed at 1 jiM. The maximal response to 1 ^iM 
CRH was seen at 4 hr following stimulation. Treatment with the cAMP analog, dbcAMP, also 
resulted in a dose-dependent increase in both StAR and steroid synthesis, reaching a maximum at 
25 1 |iM. The maximum response to dbcAMP occurred at 6 hr post stimulation. While hCG treatment 
of MA-10 cells also resulted in an increase in StAR synthesis, the levels obtained were significantly 
lower than those seen with CRH or dbcAMP. CRH treatment in the presence of hCG resulted in a 
higher level of StAR synthesis than that seen with maximal doses of CRH or hCG alone. These 

-73- 



results indicate that CRH, like hCG and dbcAMP, can stimulate the synthesis of testosterone in MA- 
10 Leydig tumor cells and apparently does so by increasing the synthesis of the StAR protein. 

EXAMPLE 18 
StAR as a Cholesterol Binding Protein 

The present study demonstrates the cholesterol binding capacity of StAR and its potential 
utility for binding and hence use in the regulation and/or reduction of cholesterol 

For these studies StAR proteins were obtained from three sources; (1) MA- 10 cells 
stimulated with dbAMP; (2) monkey kidney cells (COS 1) transfected with StAR cDNA; and (3) 
expression of a recombinant pKK233-2/StAR plasmid in E. coli. Filtration radioassays and Steady- 
State (native) polyacrylamide gel electrophoresis were used to detect cholesterol binding to StAR. 
Results from these radioassays demonstrated low levels of specific binding (20-40% of the total 
binding) of cholesterol to StAR. Fluorescent sterol binding assay results support the finding that the 
binding of NBD-cholesterol to StAR occurs with an affinity in the nanomolar range. Thus, this 
study has provided evidence that cholesterol can bind to the StAR protein with high affinity. 

EXAMPLE 19 
StAR Binding Site for SF-1 AdBP4 

In the mouse, a sequence motif that matches the known requirements for binding the orphan 
nuclear receptor, SF-l/AdBP4, is located at position -128/-135 relative to the transcriptional start 
site. SF- l/AdBP4 binding sites are present in the promoters of all the steroid hydroxylase genes, and 
SF-l/AdBP4 has been shown to transcriptionally regulate their expression in a cAMP-dependent 
manner. Thus, StAR may represent another target for SF-l/AdBP4. 

All of the compositions and methods disclosed and claimed herein can be made and executed 
without undue experimentation in light of the present disclosure. While the compositions and 
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methods of this invention have been described in terms of preferred embodiments, it will be apparent 
to those of skill in the art that variations may be applied to the composition, methods and in the steps 
or in the sequence of steps of the method described herein without departing from the concept, spirit 
and scope of the invention. More specifically, it will be apparent that certain agents which are both 
chemically and physiologically related may be substituted for the agents described herein while the 
same or similar results would be achieved All such similar substitutes and modifications apparent 
to those skilled in the art are deemed to be within the spirit, scope and concept of the invention as 
defined by the appended claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

<i) APPLICANT: Stocco, Douglas M. 

Clark, Barbara J. 

(ii) TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR REGULATION OF 

STEROIDOGENESIS 

(iii) NUMBER OF SEQUENCES: 19 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Akin, Gump, Strauss, Hauer & Feld, L*L.P. 

(B) STREET: 1900 Frost Bank Plaza, 816 Congress Avenue 

(C) CITY: Austin 

(D) STATE: TX 

(E) COUNTRY: U.S.A. 

(F) ZIP: 78701 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 

(D) SOFTWARE: 

<vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US Unknown 

(B) FILING DATS: Concurrently Herewith 

(C) CLASSIFICATION: Unknown. 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Mayfield, Denise L. 

(B) REGISTRATION NUMBER: 33,732 

(C) REFERENCE/DOCKET NUMBER: 043375 . 0006/DLM 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 512/499-6200 

(B) TELEFAX: 512/499-6290 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1466 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc ~ "DNA" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GTCGACCCAC GCGTCCGCTC AGGACCTTGA AAGGCTCAGG AAGAACAACC CTTGAGCACC 

TCAGCACTCA GCATGTTCCT CGCTACGTTC AAGCTGTGTG CTGGAAGCTC CTATAGACAT 

ATGCGGAATA TGAAAGGATT AAGGCACCAA GCTGTGCTGG CCATTGGCCA AGAGCTCAAC 

TGGAGAGCAC TGGGGGATTC CAGTCCCGGG TGGATGGGTC AAGTTCGACG TCGGAGCTCT 

CTGCTTGGTT CTCAACTGGA AGCAACACTC TATAGTGACC AGGAGCTGTC CTACATCCAG 

CAGGGAGAGG TGGCTATGCA GAAGGCCTTG GGCATACTCA ACAACCAGGA AGGCTGGAAG 

AAGGAAAGCC AGCAGGAGAA CGGGGACGAA GTGCTAAGTA AGATGGTGCC AGATGTGGGC 

AAGGTGTTTC GCTTGGAGGT GGTGGTAGAC CAGCCCATGG ACAGACTCTA TGAAGAACTT 

GTGGACCGCA TGGAGGCCAT GGGAGAGTGG AACCCAAATG TCAAGGAGAT CAAGGTCCTG 

CAGAGGATTG GAAAAGACAC GGTCATCACT CATGAGCTGG CTGCGGCGGC AGCAGGCAAC 
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CTGGTGGGGC CTCGAGACTT CGTGAGCGTG CGCTGTACCA AGCGCAGAGG TTCCACCTGT 
GTGCTGGCAG GCATGGCCAC ACATTTTGGG GAGATGCCGG AGCAGAGTGG TGTCATCAGA 
GCTGAACACG GCCCCACCTG CATGGTGCTT CATCCACTGG CTGGAAGTCC CTCCAAGACT 
AAACTCACTT GGCTGCTCAG TATTGACCTG AAGGGGTGGC TGCCGAAGAC AATCATCAAC 
CAGGTCCTAT CGCAGACCCA GATAGAGTTC GCCAACCACC TGCGCAAGCG CCTGGAAGCC 
AGCCCTGCCT CTGAGGCCCA GTGTTAAGGA CTGTCCACCA CATTGACCTG CAAATCATTG 
GAAGCTCTCA CAGGAAGCCT GCAAGTCTGT CCATCTTCAG CTAACAGCAT CGGGAGGGGT 
GGTAGTCAGG AGACACTAGG ACTGACTGGT AAAATCAGGA TCAGCAAAAT AGAAATGAGG 
CTTAGAATAA AAGTTCTCTA GTGTCTCCCA CTGCATAGCT GTGAAGGCTA AGGGATAAGT 
AGCTATGAAA CCTTTCATCT AGGCTTGTAT ATGCTGACCT AAAAGACACC AGCAGCTACG 
AACAGGGGAT GCTAAGGATC GGGAACTGTT GTCTTACCAG CTCCAAATGT CACTACCTGA 
AGGCAGTGTG CACACAAAGC AAGGTCTTGC CTAGGAAACT CTGTAAAAGT TCTCCTCTGT 
AAAAGGCCAG AACTTGAATG AAACTACCTA CAAAGGGCCT TTCCAGAGTA TTCCAACTTT 
TCTCTGAGGA GAAATGAAAC CATCATTGTG CCGACTTCCC TACTAATCCC ATGACAATAA 
AGAACATACA TAAAAAAAAA AAAAAA 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 284 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Phe Leu Ala Thr Phe Lys Leu Cys Ala Gly Ser Ser Tyr Arg His 
15 10 15 

Met Arg Asn Met Lys Gly Leu Arg His Gin Ala Val Leu Ala lie Gly 
20 25 30 

Gin Glu Leu Asn Trp Arg Ala Leu Gly Asp Ser Ser Pro Gly Trp Met 
35 40 45 

Gly Gin Val Arg Arg Arg Ser Ser Leu Leu Gly Ser Gin Leu Glu Ala 
50 55 60 

Thr Leu Tyr Ser Asp Gin Glu Leu Ser Tyr He Gin Gin Gly Glu Val 
65 70 75 80 

Ala Met Gin Lys Ala Leu Gly He Leu Asn Asn Gin Glu Gly Trp Lys 
85 90 95 

Lys Glu Ser Gin Gin Glu Asn Gly Asp Glu Val Leu Ser Lys Met Val 
100 105 110 

Pro Asp Val Gly Lys Val Phe Arg Leu Glu Val Val Val Asp Gin Pro 
115 120 125 

Met Asp Arg Leu Tyr Glu Glu Leu Val Asp Arg Met Glu Ala Met Gly 
130 135 140 
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Glu Trp Asn Pro Asn Val Lys Glu lie Lys Val Leu Gin Arg lie Gly 
145 150 155 ISO 

Lys Asp Thr Val He Thr His Glu Leu Ala Ala Ala Ala Ala Gly Asn 
165 170 175 

Leu Val Gly Pro Arg Asp Phe Val Ser Val Arg Cys Thr Lys Arg Arg 
180 185 190 

Gly Ser Thr Cys Val Leu Ala Gly Met Ala Thr His Phe Gly Glu Met 
195 200 205 

Pro Glu Gin Ser Gly Val He Arg Ala Glu His Gly Pro Thr Cys Met 
210 215 220 

Val Leu His Pro Leu Ala Gly Ser Pro Ser Lys Thr Lys Leu Thr Trp 
225 230 235 240 

Leu Leu Ser He Asp Leu Lys Gly Trp Leu Pro Lys Thr He He Asn 
245 250 255 

Gin Val Leu Ser Gin Thr Gin He Glu Phe Ala Asn His Leu Arg Lys 
260 265 270 

Arg Leu Glu Ala Ser Pro Ala Ser Glu Ala Gin Cys 

275 280 

INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Ala Glu His Gly Pro Thr Cys Met Val Leu His Pro Leu Ala 
1 5 10 

) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 
' (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Ala Leu Gly lie Leu Asn Asn Gin Glu Gly Trp Lys 
15 10 



) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY:" linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
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Gly Ser Thr Cys Val Leu Ala Gly Met Ala Thr His Phe Gly Glu Met 
15 10 15 



Pro Glu Gin 



2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Asn Gin Glu Gly Trp Lys 
1 5 



) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



-86- 



Ala Glu His Gly Pro Thr Cys Met Val 
1 5 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

lie Leu Asn Asn Gin Glu Gly Trp Lys Lys Glu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc * "DNA" 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: one-of<3, 12, 15, 18) 

(D) OTHER INFORMATION: /mod_base» OTHER 
/note. "N « (A or C or G or T/U) or (unknown or other) " 
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<ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /mod_base= OTHER 
5 /note- "R = A or G" 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: one-of(9, 21) 

10 (D) OTHER INFORMATION: /mod_base= OTHER 

/note- "Y =* C or T/U H 



US 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

SB GCNGARCAYG GNCCNACNTG YATGG 25 

g " ' (2) INFORMATION FOR SEQ ID NO: 10: 

f§) (i) SEQUENCE CHARACTERISTICS: 

D (A) LENGTH: 25 base pairs 

S (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA" 

(ix) FEATURE: 
30 (A) NAME/KEY: modif ied_base 

(B) LOCATION: one-of{5, 17) 

(D) OTHER INFORMATION: /mod_base« OTHER 
/note- "R « A or G n 

35 (ix) FEATURE: 
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(A) NAME/KEY: modif ied_base 

(B) LOCATION: one-of(8, 11, 14, 23) 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note* "N » (A or C or G or T/U) or (unknown or 

(ix) FEATURE: 

(A) NAME/ KEY: modif ied_base 

(B) LOCATION: 20 

(D) OTHER INFORMATION: /mod_base» OTHER 
/note* 11 Y a C or T/U" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CCATRCANGT NGGNCCRTGY TCNGC 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc - "DNA* 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /mod_base« OTHER 
/note- "Y C or T/U n 

(ix) FEATURE: 

(A) NAME/KEY: modified base 
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(B) LOCATION: one-of(6, 9) 
(D) OTHER INFORMATION: /mod_base* OTHER 
/note= "R =» A or G" 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /mod_base* OTHER 
/note= "N = (A or C or G or T/U) or (unknown or other) " 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

AAYCARCARG GNTGGAA 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc =* "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

TTCCANCCYT CYTGRTT 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 401 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

AACCAGGAAG GCTGGAAGAA GGAAAGCCAG CAGGAGAACG GGGACGAAGT GCTAAGTAAG 

ATGGTGCCAG ATGTGGGCAA GGTGTTTCGC TTGGAGGTGG TGGTAGACCA GCCCATGGAC 

AGACTCTATG AAGAACTTGT GGACCGCATG GAGGCCATGG GAGAGTGGAA CCCAAATGTC 

AAGGAGATCA AGGTCCTGCA GAGGATTGGA AAAGACACGG TCATCACTCA TGAGCTGGCT 

GCGGCGGCAG CAGGCAACCT GGTGGGGCCT CGAGACTTCG TGAGCGTGCG CTGTACCAAG 

CGCAGAGGTT CCACCTGTGT GCTGGCAGGC ATGGCCACAC ATTTTGGGGA GATGCCGGAG 

CAGAGTGGTG TCATCAGAGC TGAACACGGC CCCACCTGCA T 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1466 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc « "RNA" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GUCGACCCAC GCGUCCGCUC AGGACCUUGA AAGGCUCAGG AAGAACAACC CUUGAGCACC 60 

5 UCAGCACUCA GCAUGUUCCU CGCUACGUUC AAGCUGUGUG CUGGAAGCUC OJAUAGACAU 120 

AUGCGGAAUA UGAAAGGAUU AAGGCACCAA GCUGUGCUGG CCAUUGGCCA AGAGCUCAAC 180 

UGGAGAGCAC UGGGGGAUUC CAGUCCCGGG UGGAUGGGUC AAGUUCGACG UCGGAGCUCU 240 

CUGCUUGGUU CUCAACUGGA AGCAACACUC UAUAGUGACC AGGAGCUGUC CUACAUCCAG 300 

1: CAGGGAGAGG UGGCUAUGCA GAAGGCCUUG GGCAUACUCA ACAACCAGGA AGGCUGGAAG 360 

J|5 AAGGAAAGCC AGCAGGAGAA CGGGGACGAA GUGCUAAGUA AGAUGGUGCC AGAUGUGGGC 420 

^ AAGGUGUUUC GCUUGGAGGU GGUGGUAGAC CAGCCCAUGG ACAGACUCUA UGAAGAACUU 480 

%J GUGGACCGCA UGGAGGCCAU GGGAGAGUGG AACCCAAAUG UCAAGGAGAU CAAGGUCCUG 540 

jo 

O CAGAGGAUUG GAAAAGACAC GGUCAUCACU CAUGAGCUGG CDGCGGCGGC AGCAGGCAAC 600 

CUGGUGGGGC CUCGAGACUU CGUGAGCGUG CGCUGUACCA AGCGCAGAGG UUCCACOJGU 660 

25 GUGCUGGCAG GCAUGGCCAC ACAUUUUGGG GAGAUGCCGG AGCAGAGUGG UGUCAUCAGA 

GCUGAACACG GCCCCACCUG CAUGGUGCUU CAUCCACUGG CUGGAAGUCC CUCCAAGACU 780 

AAACUCACUU GGCUGCUCAG UAUUGACCUG AAGGGGUGGC UGCCGAAGAC AAUCAUCAAC 840 

30 

CAGGUCCUAU CGCAGACCCA GAUAGAGUUC GCCAACCACC UGCGCAAGCG CCUGGAAGCC 900 

AGCCCUGCCU CUGAGGCCCA GUGUUAAGGA CUGUCCACCA CAUUGACCTG CAAAUCAUUG 960 
35 GAAGCUOJCA CAGGAAGCCU GCAAGUCUGU CCAUCUUCAG CUAACAGCAU CGGGAGGGGU 



720 



1020 



-92- 



1140 



GGUAGUCAGG AGACACUAGG ACUGACUGGU AAAAUCAGGA UCAGCAAAAU AGAAAUGAGG 108 0 
CUUAGAAUAA AAGUUCUCUA GUGUCUCCCA CUGCAUAGCU GUGAAGGCUA AGGGAUAAGU 

AGCUAUGAAA CCUUUCAUCU AGGCUUGUAU AUGCUGACCU AAAAGACACC AGCAGCUACG 1200 

AACAGGGGAU GCUAAGGAUC GGGAACUGUU GUCUUACCAG CUCCAAAUGU CACUACCUGA 1260 

AGGCAGUGUG CACACAAAGC AAGGUCUUGC CUAGGAAACU CUGUAAAAGU UCUCCUOJGU 1320 

AAAAGGCCAG AACUUGAAUG AAACUACCUA CAAAGGGCCU UUCCAGAGUA UUCCAACUUU 1380 

UCUCUGAGGA GAAAUGAAAC CAUCAUUGUG CCGACUUCCC UACUAAUCCC AUGACAAUAA 1440 

AGAACAUACA UAAAAAAAAA AAAAAA 1466 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 

(D) TOPOLOGY: 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: 

SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

267 GTGGATTAAC CAGGTTCGGC GGCGGAGCTC TCTACTCGGT TCTCGGCTGG AAGAGACTCT 326 
327 CTACAGTGAC CAGGAGCTGG CCTATCTCCA GCAGGGGGAG GAGGCCATGC AGAAGGCCTT 386 
387 GGGCATCCTT AGCAACCAAG AGGGCTGGAA GAAGGAGAGT CAGCAGGACA ATGGGGACAA 446 
447 AGTGATGAGT AAAGTGGTCC CAGATGTGGG CAAGGTGTTC CGGCTGGAGG TCGTGGTGGA 506 
507 CCAGCCCATG GAGAGGCTCT ATGAAGAGCT CGTGGAGCGC ATGGAAGCAA TGGGGGAGTG 566 
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567 GAACCCCAAT GTCAAGGAGA TCAAGGTCCT 
627 TCACGAGCTG GCTGCCGAGG CAGCAGGAAA 
687 GCGCTGTGCC AAGCGCCGAG GCTCCACCTG 
747 GAACATGCCT GAGCAGAAGG GTGTCATCAG 
807 TCACCCGTTG GCTGGAAGTC CCTCTAAGAC 
867 CAAGGGGTGG CTGCCCAAGA GCATCATCAA 
927 TGCCAACCAC CTGCGCAAGC GCCTGGAGTC 
987 AC 988 



GCAGAAGATC GGAAAAGATA CATTCATTAC 626 
CCTGGTGGGG CCCCGTGACT TTGTGAGCGT 686 
TGTGCTGGCT GGCATGGACA CAGACTTCGG 746 
GGCGGAGCAC GGTCCCACTT GCATGGTGCT 806 
CAAACTTACG TGGCTACTCA GCATCGACCT 866 
CCAGGTCCTG TCCCAGACCC AGGTGGATTT 926 
CCACCCTGCC TCTGAAGCCA GGTGTTGAAG 986 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS : 

(D) TOPOLOGY: 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: 

SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

127 ATGCTGCTAG CGACATTCAA GCTGTGCGCT GGGAGCTCCT ACAGACACAT GCGCAACATG 186 
187 AAGGGGCTGA GGCAACAGGC TGTGATGGCC ATCAGCCAGG AGCTGAACCG GAGGGCCCTG 246 
247 GGGGGCCCCA CCCC 260 



(2) INFORMATION FOR SEQ ID NO:17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 

(D) TOPOLOGY: 
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30 



(ii) MOLECULE TYPE: 

(A) DESCRIPTION: 

SEQUENCE DESCRIPTION: SEQ ID NO: 17; 

1051 ACTGGAAGCC TGCAAGTCT 1069 



(2) INFORMATION FOR SEQ ID NO: 18: 

10 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 
% (B) TYPE : 

EP (C) STRANDEDNESS : 

15 (D) TOPOLOGY: 



~ (ii) MOLECULE TYPE: 

(A) DESCRIPTION: 

2p SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

!f 1 MLLATFKLCA GSSYRHMRNM KGLRQQAVMA ISQELNRRAL GGPTPSTWIN 50 

51 QVRRRSSLLG SRLEETLYSD QELAYLQQGE EAMQKALGIL SNQEGWKKES 100 

101 QQDNGDKVMS KWPDVGKVP RLEWVDQPM ERLYEELVER MEAffeEWNPN 150 

25 151 VKEIKVLQKI GKDTFITHEL AAEAAGNLVG PRDFVSVRCA KRRGSTCVLA 200 

201 GMATDFGNMP EQKGVIRAEH GPTCMVLHPL AGSPSKTKLT WLLSIDLKGW 250 

251 LPKSIINQVL SQTQVDFANHL RKRLESHPA SEARC 285 



(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 
35 (D) TOPOLOGY: 
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(ii) MOLECULE TYPE: 

(A) DESCRIPTION: 

SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

agaacaccag gtccaggctg cagctgcggg actcagaggc gaacgttgag gggctcagga 60 
aggacgaaga accacccttg agagaagagg cagcagcagc gcggcagcag cagcggcagc 120 
gaccccacca ctgccacatt tgccaggaaa caATGCTGCT AGCGACATTC AAGCTGTGCG 180 
CTGGGAGCTC CTACAGACAC ATGCGCAACA TGAAGGGGCT GAGGCAACAG GCTGTGATGG 240 
CCATCAGCCA GGAGCTGAAC CGGAGGGCCC TGGGGGGCCC CACCCCTAGC ACGTGGATTA 300 
ACCAGGTTCG GCGGCGGAGC TCTCTACTCG GTTCTCGGCT GGAAGAGACT CTCTACAGTG 360 
ACCAGGAGCT GGCCTATCTC CAGCAGGGGG AGGAGGCCAT GCAGAAGGCC TTGGGCATCC 420 
TTAGCAACCA AGAGGGCTGG AAGAAGGAGA GTCAGCAGGA CAATGGGGAC AAAGTGATGA 480 
GTAAAGTGGT CCCAGATGTG GGCAAGGTGT TCCGGCTGGA GGTCGTGGTG GACCAGCCCA 540 
TGGAGAGGCT CTATGAAGAG CTCGTGGAGC GCATGGAAGC AATGGGGGAG TGGAACCCCA 600 
ATGTCAAGGA GATCAAGGTC CTGCAGAAGA TCGGAAAAGA TACATTCATT ACTCACGAGC 660 
TGGCTGCCGA GGCAGCAGGA AACCTGGTGG GGCCCCGTGA CTTTGTGAGC GTGCGCTGTG 720 
CCAAGCGCCG AGGCTCCACC TGTGTGCTGG CTGGCATGGC CACAGACTTC GGGAACATGC 780 
CTGAGCAGAA GGGTGTCATC AGGGCGGAGC ACGGTCCCAC TTGCATGGTG CTTCACCCGT 840 
TGGCTGGAAG TCCCTCTAAG ACCAAACTTA CGTGGCTACT CAGCATCGAC CTCAAGGGGT 900 
GGCTGCCCAA GAGCATCATC AACCAGGTCC TGTCCCAGAC CCAGGTGGAT TTTGCCAACC 960 
ACCTGCGCAA GCGCCTGGAG TCCCACCCTG CCTCTGAAGC CAGGTGTTGA agaccagcct 1020 
gctgttccca actgtgccca gctgcactgg tacacacgct catcaggaga atccctactg 1080 
gaagcctgca agtctaagat ctccatctgg tgacagtggg atgggtgggg ttcgtgttta 1140 
gagtatgaca ctaggattca gattggtgaa agtttttagt accaagaaaa cagggatgag 1200 
ctcttggatt aaaaggtaac ttcattcact gattagctat gacatgaggg ttcaggcccg 1260 
ctaaaaataa ttgtaaaact ttttttctgg gcccttatgt acccacctaa aaccatcttt 1320 
aaaatgctag tggctgatat gggtgtgggg gatgctaacc acagggcctg agaagtcttg 1380 
ctttatgggc tcaagaatgc catgcgctgg cagtacatgt gcacaaagca gaatctcaga 1440 
gggtctcctg cagccctctg ctcctcccgg ccgctgcaca gcaacaccac agaacaagca 1500 
gcaccccaca gtgggtgcct tccagaaata tagtccaagc tttctctgtg gaaaaagaca 1560 
aaactcatta gtagacatgt ttccctattg ctttcatagg caccagtcag aataaagaat 1620 
cataattcac acaaaaaaaa a 
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WHAT IS CLAIMED TS: 



1 . A purified nucleic acid molecule having a nucleotide sequence encoding a steroidogenic 

acute regulatory protein, the protein having an amino acid sequence having between 80% and 99% 
identity to the sequence of SEQ ID NO: 2. 

2- A purified nucleic acid molecule complementary to the nucleic acid molecule of claim 

1. 

3 • The purified nucleic acid molecule of claim 1 wherein the steroidogenic acute regulatory 

protein has the amino acid sequence of SEQ ID NO: 2, 

4. The purified nucleic acid molecule of claim 1 further defined as having a nucleotide 
sequence with about 85% to about 99% identity to the nucleic acid sequence of SEQ ID NO: 1, 

5. The purified nucleic acid molecule of claim 1 further defined as a DNA molecule and 
being substantially free of DNA molecules not encoding a steroidogenic acute regulatory protein. 

6. A recombinant vector comprising the purified nucleic acid molecule of claim 1 . 

7. The recombinant vector of claim 6, further defined as an expression vector comprising 
a promoter operatively linked to said nucleic acid molecule. 

8. The recombinant vector of claim 6, further defined as a pCMV, adenoviral, retroviral, 
pUC, SV40, yeast plasmid, Bacidovirus or Vaccinia virus vector. 

9. A recombinant host cell comprising the recombinant vector of claim 6. 



-97- 



10. The recombinant host cell of claim 9, further defined as a Leydig cell, a COS cell, an 
adrenalcortical cell, an ovarian granulosa cell, Saccharomyces cerevisiae, or Escherichia coli cell. 

11. A purified nucleic acid molecule having a nucleotide sequence that encodes an amino 
acid sequence extending from amino acid methionine at position 48 through amino acid cysteine at 
position 284 essentially as set forth in SEQ ID NO:2. 



12. A purified nucleic acid molecule comprising a sequence that corresponds to, or is capable 
of hybridizing to the nucleic acid sequence of SEQ ID NO:l under conditions standard for 
hybridization fidelity and stability or degenerate variants thereof. 

13. A purified nucleic acid molecule having a nucleotide sequence of SEQ ID NO:9, SEQ 
ID NO: 1 0, SEQ ID NO: 1 1 , SEQ ID NO: 12, SEQ ID NO: 1 3, or SEQ ID NO: 14. 

14. A purified steroidogenic acute regulatory protein having an amino acid sequence as set 
forth in SEQ ID NO:2. 

15. A purified steroidogenic acute regulatory protein having an amino acid sequence 
extending from amino acid methionine at position 48 through amino acid cysteine at position 284 
in SEQ ID NO:2. 

1 6. The purified steroidogenic acute regulatory protein of claim 14 defined further as being 
phosphorylated. 

1 7. A purified polypeptide having an amino acid sequence in accordance with SEQ ID NO:3, 
SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, or SEQ ID NO:8. 

1 8. A method of preparing a steroidogenic acute regulatory protein encoded by the purified 
nucleic acid molecule of claim 1, comprising: 
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preparing a recombinant host bearing the nucleic acid molecule of claim 1, the host 
being capable of expressing the protein; 

culturing the recombinant host to produce steroidogenic acute regulatory protein; and 

collecting the steroidogenic acute regulatory protein having an amino acid sequence with 
about 80% to about 99% identity to the amino acid sequence. 

19. The method of claim 1 8 wherein the recombinant host is a COS cell. 

20. A purified nucleic acid molecule having a nucleotide sequence encoding a steroidogenic 
acute regulatory protein, the protein having about 80% to about 99% amino acid sequence identity 
to SEQ ID NO:2, said nucleic acid molecule obtained by a process of: 

preparing oligonucleotides that encodes a StAR peptide having about 80% to about 99% 
amino acid sequence homology to SEQ ID NO:2 ; 

screening an animal cell DNA library with said oligonucleotides; and 

obtaining the purified nucleic acid molecule having a nucleotide sequence encoding a 
steroidogenic acute regulatory protein, the protein having an amino acid sequence 
with about 80% to about 99% amino acid sequence identity to SEQ ID NO:2. 

21. An antibody having binding specificity for the steroidogenic acute regulatory protein 
encoded by the nucleic acid molecule of claim 1. 

22. The antibody of claim 2 1 defined further as having binding specificity for amino acids 
1-26, 10-26, 36-47, or 88-98 of SEQ ID NO:2. 
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23. The antibody of claim 2 1 further defined as a polyclonal antibody. 

24. An anti-StAR antibody, said antibody prepared by a process of: 

immunizing an animal with an immunogen comprising a peptide having an amino acid 
sequence as defined in SEQ ID NO: 8 to provide an immunized animal that 
produces antibody capable of binding with a StAR peptide having an amino acid 
sequence as defined in SEQ ID NO:2; 

obtaining sera from said immunized animal to provide immune sera; and 

collecting anti-StAR antibodies from said immune sera. 

25. A hybridoma cell line that produces an antibody having binding specificity for the 
steroidogenic acute regulatory protein encoded by the nucleic acid molecule of claim 1 . 

26. The hybridoma cell line of claim 25 prepared.by a process of: 

immunizing an animal with a steroidogenic acute regulatory protein having an amino 
acid sequence of SEQ ID NO:8 or having an amino acid sequence with 80% to 
95% homology to SEQ ID NO:2; 

collecting anti-steroidogenic acute regulatory protein antibody producing cells from the 
immunized animal ; and 

fusing the antibody producing ceils with a neoplastic animal cell line to obtain an 
hybridoma cell line. 

27. An immunoassay for the detection of a steroidogenic acute regulatory protein in a 
biological sample, comprising: 
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preparing an antibody as defined in claim 24; 



incubating the antibody with the biological sample for a sufficient time to permit binding 
between antibody and steroidogenic acute regulatory protein present in said 
biological sample to form an incubate; and 

determining the presence of bound antibody by contacting the incubate with a detectabiy 
labelled antibody specific for the anti-steroidogenic acute regulatory protein 
antibody of claim 24, 

wherein the presence of anti-steroidogenic acute regulatory protein antibody in the biological sample 
is detectable as a measure of the detectabiy labelled antibody from the biological sample. 

28. The immunoassay of claim 27 wherein the detectabiy labelled antibody is an enzyme- 
linked antibody, a fluorescent tagged antibody or a radiolabeled antibody. 

29. A method for screening for a chromosomal genetic lesion comprising: 

preparing a nucleic acid probe having a nucleotide sequence having about 80% to about 
99% homology to SEQ ID NO: 1 ; and 

contacting a chromosomal sample with the probe to allow hybridization of the sample 
to the probe under conditions standard for hybridization fidelity and stability; 

wherein lack of specific hybridization of the probe and the chromosomal sample indicates a 
chromosomal genetic lesion. 

30. The method of claim 29 wherein the genetic lesion is a deletion, a rearrangement, an 
insertion, a transition, a transversion, a ftameshift, a missense or a nonsense mutation* 
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31. The method of claim 29 wherein the genetic lesion correlates with the presence of lipoid 
congenital adrenal hyperplasia. 

32. The method of claim 29 wherein the chromosomal sample is from adrenal tissue, gonadal 
tissue, or blood. 



33 . A screening method for lipoid congenital adrenal hyperplasia comprising 
obtaining a chromosomal sample to provide a test sample; 

preparing a nucleic acid probe having a nucleotide sequence essentially as set forth in 
SEQ ID NO: 1 or a sequence with 80% to 99% identity thereto; 

contacting the test sample with the nucleic acid probe under hybridization conditions 
allowing for detection of a mismatch in a hybridizing molecule as a screening 
method for lipoid congenital adrenal hyperplasia. 

34. A method for stimulating cholesterol transport comprising administering a 
pharmacologically effective amount of steroidogenic acute regulatory protein having an amino acid 
sequence with 80% to 99% identity to the sequence of SEQ ID NO:2. 



35. A method for increasing production of progesterone comprising administering a 
pharmacologically effective amount of steroidogenic acute regulatory protein having an amino acid 
sequence with 80% to 99% homology to SEQ ID NO:2. 

36. A method for increasing steroidogenesis comprising adrninistering a pharmacologically 
effective amount of steroidogenic acute regulatory protein having an amino acid sequence with 80% 
to 99% identity to SEQ ID NO:2. 
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COMPOSITIONS AND METHODS FOR REGULATION OF STEROIDOGENESIS 



Abstract of the Disclosure 



Compositions and methods relating to the regulation of transport of cholesterol into the 
mitochondria of a cell and, therefore, for the regulation of steroidogenesis are provided. 
Compositions include nucleic acid molecules encoding a steroidogenic acute regulatory protein 
(StAR), StAR protein molecules and peptides having amino acid sequences as disclosed herein, and 
anti-StAR antibodies. Methods include immunoassays using anti-StAR antibodies and nucleic acid 
based screening methods for pathologies correlated with defects in StAR, such as lipoid congenital 
adrenal hyperplasia and dose sensitive sex reversal. In addition, these compositions an^ 
may be useful for treatment of steroid hormone-dependent disorders, in parti^ 
congenital adrenal hyperplasia. 
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1 gtcgacccacgcgtccgctcaggaccttgaaaggctcaggaagaacaaccctTGAgcacc 
61 tcagcactcagcATGTTCCTCGCTACGTTCAAGCTGTGTGCTGGAAGCTCCTATAGACAT 

MetPheLeuAlaThrPheLysLeuCysAlaGlySerSerTyrArgHis 15 

121 ATGCGGAATATGAAAGGATTAAGGCACCAAGCTGTGCTGGCCATTGGCCAAGAGCTCAAC 

MetArgAsnMetLysGlyLeuArgHisGlnAlaValLeuAlalleGlyGlnGluLeuAsn 36 

181 TGGAGAGCACTGGGGGATTCCAGTCCCGGGTGGATGGGTCAAGTTCGACGTCGGAGCTCT 

TrpArgAlaLeuGlyAspSerSerProGlyTrpMetGlyGlnValArgArgArgSerSer 56 

241 CTGCTTGGTTCTCAACTGGAAGCAACACTCTATAGTGACCAGGAGCTGTCCTACATCCAG 

LeuLeuGlySerGlnLeuGluAlaThrLeuTyrSerAspGlnGluLeuSerTyrlleGln 7 6 

301 CAGGGAGAGGTGGCTATGCAGAAGGCCTTGGGCATACTCAACA^CCAGGMGGCTGGAAG 

Gln^l 1^-1 " Va 1 *t »Mqff3l nT.ysAl aT.pnfllvIleLeuAsnAsnGlnGluGlvTrpLvs 9 6 

25 

361 AAGGAAAGCCAGCAGGAGAACGGGGACGAAGTGCTAAGTAAGATGGTGCCAGATGTGGGC 

LysGluSerGlnGlnGluAsnGlyAspGluValLeuSerLysMetValProAspValGly 116 

421 AAGGTGTTTCGCTTGGAGGTGGTGGTAGACCAGCCCATGGACAGACTCTATGAAGAACTT - 
LysValPheArgLeuGluValValValAspGlnProMetAspArgLeuTyrGluGluLeu 136 

481 GTGGACCGCATGGAGGCCATGGGAGAGTGGAACCCAAATGTCAAGGAGATC AAGGTCCTG 

ValAspArgMetGluAlaMetGlyGluTrpAsnProAsnValLysGluIleLysValLeu 156 

541 CAGAGGATTGGAAAAGACACGGTCATCACTCATGAGCTGGCTGCGGCGGCAGCAGGCAAC 

GlnArglleGlyLysAspThrVallleThrHisGluLeuAlaAlaAlaAlaAlaGlyAsn 17 6 

601 CTGGTGGGGCCTCGAGACTTCGTGAGCGTGCGCTGTACCAAGCGCAGAGGTTCCACCTGT 

LeuValGlyProArgAspPheValSerValArgCysThrLysArgArgGlySerThrCYS 196 

661 ^r^Tr.r.r&r^r&Tr^rr&rACATTTTGGGGAGATGCCGGAGCAGAGTGGTGTCATCAGA 

v a U.miAlaGlvMftf.Al»Thr H<aPhftGlvGl»MetProGluGlnSerGlvValIleArg 216 

45 

721 nrTGAArAGGGT rrrArCTGCATGGTGCTTCATCCACTGGCTGGAAGTCCCTCCAAGACT 

AlaGluHisGlvPrnThrCvs MPfValLeuHisProLeuAlaGlvSerProSerLysThr 23 6 

23 

781 AAACTCACTTGGCTGCTCAGTATTGACCTGAAGGGGTGGCTGCCGAAGACAATCATCAAC 

LysLeuThrTrpLeuLeuSerlleAspLeuLysGlyTrpLeuProLysThrllelleAsn 256 

841 CAGGTCCTATCGCAGACCCAGATAGAGTTCGCCAACCACCTGCGCAAGCGCCTGGAAGCC 

GlnValLeuSerGlnThrGlnlleGluPheAlaAsnHisLeuArgLy sArgLeuGluAla 276 

901 AGCCCTGCCTCTGAGGCCCAGTGTTAAggactgtccaccacattgacctgcaaatcattg 

SerProAlaSerGluAlaGlnCysEnd 284 

961 gaagctctcacaggaagcctgcaagtctgtccatcttcagctaacagcatcgggaggggt 

1021 ggtagtcaggagacactaggactgactggtaaaatcaggatcagcaaaatagaaatgagg 

1081 cttagaataaaagttctctagtgtctcccactgcatagctgtgaaggctaagggataagt 

1141 agctatgaaacctttcatctaggcttgtatatgctgacctaaaagacaccagcagctacg 

1201 aacaggggatgctaaggatcgggaactgttgtcttaccagctccaaatgtcactacctga 

1261 aggcagtgtgcacacaaagcaaggtcttgcctaggaaactctgtaaaagttctcctctgt 

1321 aaaaggccagaacttgaatgaaactacctacaaagggcctttccagagtattccaacttt 

1381 tctctgaggagaaatgaaaccatcattgtgccgacttccctactaatcccatgacAATAA 

1441 AgaacatacatAAAAAAAAAAAAAAA 



FIG. 3 



Human steroidogenic acute regulatory protein (StAR) mRNA, 
Length =» 1605 



Identities - 104/134 (77% identity), mouse segment 73-206 to human segment 127- 
260; 

Mouse: 73* ATGTTCCTCGCTACGTTCAAGCTGTGTGCTGGAAGCTCCTATAGACATATGCGGAATATG 132 

in i ii ii ii liiiiiniii him iiiiiiii iimi inn n m 

Human: 127 ATGCTGCTAGCGACATTCAAGCTGTGCGCTGGGAGCTCCTACAGACACATGCGCAACATG 186 
Mouse: 133 AAAGGATTAAGGCACCAAGCTGTGCTGGCCATTGGCCAAGAGCTCAACTGGAGAGCACTG 192 

ii ii i urn ii mm imm im mil m mi u m 

Human: 187 AAGGGGCTGAGGCAACAGGCTGTGATGGCCATCAGCCAGGAGCTGAACCGGAGGGCCCTG 246 

Mouse: 193 GGGGATTCCAGTCC 206 

III! III II 
Human: 247 GGGGGCCCCACCCC 260 



Identities = 
988; 



612/722 (84% identity) ; mouse segment 210-931 to human segment 267- 



Mouse: 210 GTGGATGGGTCAAGTTCGACGTCGGAGCTCTCTGCTTGGTTCTCAACTGGAAGCAACACT 269 

mm ii urn 11 iimiimi n mini mini 11 n 

Human: 267 GTGGATTAACCAGGTTCGGCGGCGGAGCTCTCTACTCGGTTCTCGGCTGGAAGAGACTCT 326 



Mouse: 270 

Human: 327 

Mouse: 330 

Human: 387 

Mouse: 390 

Human: 447 

Mouse: 450 

Human: 507 



CTATAGTGACCAGGAGCTGTCCTACATCCAGCAGGGAGAGGTGGCTATGCAGAAGGCCTT 329 

III i i 1 1 1 1 ! I M 1 1 11 1 I III limillll I III III Ii III M Ii Mill 

CTACAGTGACCAGGAGCTGGCCTATCTCCAGCAGGGGGAGGAGGCCATGCAGAAGGCCTT 386 
GGGCATACTCAACAACCAGG7VAGGCTGGAAGAAGGAAAGCCAGCAGGAGAACGGGGACGA 389 

mm ii i mm 11 immimiii n mum n mm 1 

GGGCATCCTTAGCAACCAAGAGGGCTGGAAGAAGGAGAGTCAGCAGGACAATGGGGACAA 446 
AGTGCTAAGTAAGATGGTGCCAGATGTGGGCAAGGTGTTTCGCTTGGAGGTGGTGGTAGA 449 

1 1 1 1 i inn iii i i ii 1 1 1 1 1 1 1 m 1 1 1 1 1 1 1 1 ii imm mil 11 

AGTGATGAGTAAAGTGGTCGCAGATGTGGGCAAGGTGTTCCGGCTGGAGGTCGTGGTGGA 506 
CCAGCCCATGGACAGACTCTATGAAGAACTTGTGGACCGCATGGAGGCCATGGGAGAGTG 509 

imiimm n mimim n urn mum n inn urn 

CCAGCCCATGGAGAGGCTCTATGAAGAGCTCGTGGAGCGCATGGAAGCAATGGGGGAGTG 566 



Mouse: 510 
Human: 567 



GAACCCAAATGTCAAGGAGATCAAGGTCCTGCAGAGGATTGGAAAAGACACGGTCATCAC 569 

mm mmiMiiiiimimmmi m iiiiiiii ii mm ii 

GAACCCCAATGTCAAGGAGATCAAGGTCCTGCAGAAGATCGGAAAAGATACATTCATTAC 626 



FIG. 3, cont. 



Mouse : 


570 


TCATGAGCTGGCTGCGGCGGCAGCAGGCAACCTGGTGGGGCCTCGAGACTTCGTGAGCGT 

IN llllllllill 1 lllllllll IIIIMIIIMIII II Mill llllllll 

TCACGAGCTGGCTGCCGAGGCAGCAGGAAACCTGGTGGGGCCCCGTGACTTTGTGAGCGT 


629 


Human : 


627 


686 


Mouse : 


630 


GCGCTGTACCAAGCGCAGAGGTTCCACCTGTGTGCTGGCAGGCATGGCCACACATTTTGG 


689 


Human : 


687 


IMIMI llllllll MM II 1 1 II II 1 M 1 II 1 1 1 lllllll MM 1 II II 

GCGCTGTGCCAAGCGCCGAGGCTCCACCTGTGTGCTGGCTGGCATGGACACAGACTTCGG 


746 


Mouse: 


690 


GGAGATGCCGGAGCAGAGTGGTGTCATCAGAGCTGAACACGGCCCCACCTGCATGGTGCT 

| | Mill lllllll II II II II II 1 II II Mill Mill MIIMIMM 
GAACATGCCTGAGCAGAAGGGTGTCATCAGGGCGGAGCACGGTCCCACTTGCATGGTGCT 


749 


Human : 


747 


806 


Mouse: 


750 


TCATCCACTGGCTGGAAGTCCCTCCAAGACTAAACTCACTTGGCTGCTCAGTATTGACCT 

Ml || || II II 1 II 1 1 1 II 1 1 Mill Mill II Mill Mill II Mill 

TCACCCGTTGGCTGGAAGTCCCTCTAAGACCAAACTTACGTGGCTACTCAGCATCGACCT 


809 


Human : 


807 


866 


Mouse : 


810 


GAAGGGGTGGCTGCCGAAGACAATCATCAACCAGGTCCTATCGCAGACCCAGATAGAGTT 
• i i i i i i t i i i i i i l l l 1 t 1 1 1 I ! 1 1 ! 1 1 1 i 1 1 1 1 11 1 1 1 1 1 M 1 1 1 II II 

1 1| II II 1 II 1 II II M II 1 II M II II 1 II 1 II lllllllll 1 1 1 11 

CAAGGGGTGGCTGCCCAAGAGCATCATCAACCAGGTCCTGTCCCAGACCCAGGTGGATTT 


869 


riuman - 


o o / 


926 


Mouse : 


870 


CGCCAACCACCTGCGCAAGCGCCTGGAAGCCAGCCCTGCCTCTGAGGCCCAGTGTTAAGG 

| M II II II II 1 II II II M 1 II 1 1 1 M IMIMMMM III Mill 1 1 

TGCCAACCACCTGCGCAAGCGCCTGGAGTCCCACCCTGCCTCTGAAGCCAGGTGTTGAAG 


929 


Human : 


927 


986 


Mouse : 


930 


AC 931 




Human : 


987 


II 

AC 988 





Identities = 18/19 (94% identity), mouse segment 970-988 to human segment 105 
1069 

Mouse: 970 ACAGGAAGCCTGCAAGTCT 988 

II II II II II II II II M 
Human: 1051 ACTGGAAGCCTGCAAGTCT 1069 
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KN93 13 MM) 
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CI C2 

Bovine 1 MLLATFKLCA GSSYRHvRsM KGLqQQAVLA IGQELNRRAL GGPaPaaWIN 50 
Human MLLATFKLCA GSSYRHMRNM KGLRQQAVMA IsQELNRRAL GGPtPstWIN 

Mouse MFLATFKLCA GSSYRHMRNM KGLRhQAVLA IGQELNWRAL GdssPg.Wmg 

Ovine 

Consensus MLLATFKLCA GSSYRHMRNM KGLRQQAVLA IGQELNRRAL GGP-P WIN 

X CK 

Bovine 51 QVRRRgSLLG SQLEDpLYSD QELAhlQQGE EAMQrALGIL JcdQEGWKKES 100 
Human QVRRRSSLLG SrLEETLYSD QELAY1QOGE EAMQKALGIL sNQEGWKKES 

Mouse QVRRRSSLLG SQLEaTLYSD QELsYIQQGE vAMQKALGIL nNQEGWKKES 

Ovine K* 2 * 

Consensu* QVRRRSSLLG SQLE-TLYSD QELAYIQQGE EAMQKALGIL -NQEGWKKES 



Bovine 101 rQaNGDEVLS KVIPDVGKVF RLEVWDQPM ERLYEELVER MEAMGEWNPN 150 
Human qQdNGDkVMS KWPDVGKVF RLEVWDQPM ERLYEELVER MEAMGEWNPN 

Mouse qQeNGDEVLS KmVPDVGKVF RLEVWDQPM DRLYEELVDR MEAMGEWNPN 

Ovine rQaNGDEVLS KVIPDVGKVF RLEVWDQPM ERLYEELVER MEAMGEWNPs 

Consensus -Q-NGDEVLS KV-PDVGKVF RLEVWDQPM ERLYEELVER MEAMGEWNPN 

A 

Bovine 151 VKEIKVLQKI GKDTVITHEL AAEvAGNLVG PRDFVSVRCT KRRGSmCVLA 200 
Human VKEIKVLQKI GKDTf ITHEL AAEAAGNLVG PRDFVSVRCa KRRGStCVLA 

Mouse VKEIKVLQrl GKDTVITHEL AAaAAGNLVG PRDFVSVRCT KRRGStCVLA 

Ovine VKEIKVLQKI GKDTIITHEL AAEAAGNLVG PRDFVrVRCT KRRGSmCVLA 

Consensus VKEIKVLQKI GKDTVITHEL AAEAAGNLVG PRDFVSVRCT KRRGS-CVLA 

CD C 

Bovine 201 GMATLYeEMP qQKGVIRAEH GPTCMVLrPL AGSPSrTKLT WLLSIDLKGW 250 
Human GMdTdFgnMP eQKGVIRAEH GPTCMVLKPL AGSPSKTKLT WLLSIDLKGW 

Mouse GMAThFgEMP eQsGVIRAEH GPTCMVLHPL AGSPSKTKLT WLLSIDLKGW 

Ovine GtATLYeEMP qQKGVIR 

Consensus GMATL--EMP -QKGVIRAEH GPTCMVLHPL AGSPSKTKLT WLLSIDLKGW 



Bovine 251 LPKTIINQVL SQTQVDFANH LRKRLEScPA 1EARC 285 

Human LPKsIINQVL SQTQVDFANH LRKRLEShPA SEARC 

Mouse LPKTIINQVL SQTQIEFANH LRKRLEasPA SEAqC 

Ovine 

Consensus LPKTIINQVL SQTQVDFANH LRKRLES-PA SEARC 
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§ 1.56(a), regarding events which occurred between the filing date of the prior application 
and the national or PCT international filing date of this application: 



08/538.960 10/4/95 Pending 

(Application Serial No.) (Filing Date) (Status) 

I hereby direct that all correspondence and telephone calls be addressed to Denise 
L. Mayfield, Akin, Gump, Strauss, Hauer & Feld, L.L.P., 1900 Frost Bank Plaza, 816 
Congress Avenue, Austin, Texas 78701, (512) 499-6200. 



I hereby declare that all statements made of my own knowledge are true and that 
all statements made on information and belief are believed to be true; and further that these 
statements were made with the knowledge that willful false statements and the like so made 
are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the 
United States Code and that such willful false statements may jeopardize the validity of the 
application or any patent issued thereon. 



Inventor's Full Name: Douglas fcL StQcco 

(Fir;*) (Initial) , (Last) 



Inventor's Signature: 




Date: 



^/^/^Co Country of Citizenship: 



Residence Address: 



Post Office 
Address: 



(Include number, street name, city, state, and country) 



(If different from Residence Address) 



I hereby declare that all statements made of my own knowledge are true and that 
all statements made on information and belief are believed to be true; and further that these- 
statements were made with the knowledge that willful false statements and the like so made 
are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the 
United States Code and that such willful false statements may jeopardize the validity of the 
application or any patent issued thereon. 



Inventors Full Name: Barbara, L Clark 

(First) (Initial) (Last) 



Inventor's Signature: 



^h^ /l It 

Date: 

Country of Citizenship: 

Residence Address: Tfr/i/fV //j/As Lfi 

(Include number, street name, city, state, and country) 



Post Office 
Address: 



(If different from Residence Address) 
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